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NOVEL BACTERIAL GENES AND PROTEINS THAT ARE ESSENTIAL FOR 
CELL VIABILITY AND THEIR USES 



Throughout this application various publications are referenced. The disclosures of these 
publications in their entireties are hereby incorporated by reference into this application in 
order to more fully describe the state of the art to which this invention pertains. 

FIELD OF THE INVENTION 

The present invention relates generally to nucleotide sequences, and polypeptides 
encoded by the sequences, that are essential for bacterial viability, and to methods of 
using the nucleotide and polypeptide sequences. 

BACKGROUND OF THE INVENTION 

Bacterial genera, such as Streptococcus, Staphylococcus, Pseudomonas, Yersinia, 
Salmonella, and Enterobacter, are the cause of numerous afflictions in humans and 
animals. Bacterial infection can lead to serious health conditions, including pneumonia, 
osteomyelitis, meningitis, sinusitis, otitis, cystitis, and even food poisoning. Typically, 
these infections can be treated with standard antimicrobial agents such as antibiotics. 
However, the emergence of pathogenic bacterial strains that are resistant to antibiotics 
has risen alarmingly in the past two decades. This situation has created an urgent need 
for the development of new antimicrobial agents. 

One strategy for developing new antimicrobial agents is to identify bacterial gene 
sequences that encode gene products that are essential for bacterial cell viability and 
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develop and/or identify agents which inhibit the function of the gene product. DNA 
sequencing technology has advanced from sequencing one gene at a time to sequencing 
entire genomes, the sum of all genes in an organism. With the recent arrival of bacterial 
genomic information, it is now possible to compare multiple bacterial genomes in an 
5 attempt to identify genes that encode conserved gene products. In this manner, one 
skilled in the art may identify a set of conserved bacterial genes, including a subset of 
genes that are essential for bacterial cell viability. The essential gene is then used as a 
starting point to develop therapeutic agents that inhibit or inactivate the product of the 
essential gene. 

10 

The availability of DNA sequence information for multiple microbial genomes is a recent 
development. The public release of the first complete genome, Haemophilus influenzae 
(Fleischmann, R.D., et al. 1995 Science 269:496-512 ), was followed in rapid succession 
by a number of public and private genome sequencing programs. Presently, some 20 
15 completely sequenced bacterial genomes have been published, and over 100 other 
sequencing projects are underway (Blattner, F.R., et al., 1997 Science 277:1453-74; 
FerTetti, J.J., et al., 1997 Adv Exp Med Biol 418:961-963; Koonin, E.V., et al, 1996 - 
Methods Enzymol 266:295-322). Analyses of these data indicate that approximately 
46% of putative bacterial genes are of unknown function having no attributable function. 

20 

Others have pursued various strategies to identify bacterial genes that are essential for 
viability. These strategies include: identifying genes that are expressed by the bacteria 
when present in the infected host (Hensel," M., et al., 1995 Science 269:400-3), . 
identifying essential genes by isolating temperature sensitive mutants (Schmid, M.B., et 
25 al., 1998 Curr Opin Chem Biol 2:529-34), and identifying genes in pathways known 
from prior physiological studies to be essential (Skarzynski, T. et al, 1996 Structure 
1996 4:1465-74) 

There continues to be a need to identify bacterial genes that encode gene products that are 
30 essential for cell viability, such as cell replication, growth, and survival. These genes and 
their encoded gene products can be used as a starting point towards identifying agents 
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that inhibit functions essential for cell viability, thereby causing bacterial cell stasis or 
death (e.g., antibacterial agents). 

The present invention provides experimental identification of novel, conserved essential 
5 genes (ceg) from bacteria and their encoded protein products. The ceg genes are 
considered essential to cell viability because disruption of an endogenous ceg gene results 
in lethality of a bacterial cell (e.g., as determined by failure to recover viable 
chloramphenicol-resistant colonies, as described herein). Thus, the gene products 
encoded by these genes are potentially valuable targets for chemotherapeutic intervention 
10 of bacterial infections. 

The ceg nucleotide sequences of the invention were obtained by large-scale 
computational comparisons of multiple genome sequences to identify conserved protein 
coding regions, followed by gene disruption to identify cegs. The conservation of protein 
15 sequences in many cases is believed to reflect the higher level conservation of common 
biochemical pathways essential for bacterial function and viability. 

SUMMARY OF THE INVENTION 

20 The acronyms "CEG" and "ceg" stand for Conserved Essential Gene. For convenience, 
the italicized term ceg refers herein to ceg nucleotide sequences. The capitalized term CEG 
refers herein to CEG polypeptide sequences. 

Embodiments of the ceg nucleotide sequences and the CEG polypeptide sequences are 
25 designated CFEs which stands for CEG For Expression. The CFEs are polypeptides 
resulting from expression of the ceg nucleotide sequence. 

The .present invention provides isolated nucleotide sequences of conserved essential 
genes from bacteria, designated ceg. The invention also provides recombinant nucleic 
30 acid molecules including the ceg sequences of the invention, and methods of uses thereof. 
Examples of nucleic acid molecules having ceg sequences are described in SEQ ID 
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NOS.: 1-113. The invention further provides isolated polypeptides and recombinant 
polypeptides having the CEG sequences of the invention, and methods of uses thereof. 
Examples of polypeptides having CEG sequences are described in SEQ ID NOS.: 114- 
226. 

5 

The ceg sequences of the present invention are DNA or RNA. Further, the invention 
includes nucleic acid molecules that are identical or nearly identical (e.g., similar) with 
the ceg sequences of the invention. The invention additionally provides polynucleotide 
sequences that hybridize under stringent conditions to the ceg sequences of the invention. 
10 A further embodiment provides polynucleotide sequences which are complementary to 
the ceg sequences of the invention. Yet another embodiment provides ceg nucleic acid 
molecules that are labeled with a detectable marker. Another embodiment provides 
recombinant nucleic acid molecules, such as a vector or a fusion molecule, including the 
ceg sequences of the invention. 

15 

The present invention provides various ceg sequences, fragments thereof having essential 
gene activity, and related molecules such as antisense molecules, oligonucleotides, 
peptide nucleic acids (PNA), fragments, and portions thereof. 

The present invention relates to the inclusion of the polynucleotides encoding CEG gene 
products, such as CEG polypeptides, in an expression vector which can be used to 
transform host cells or organisms. Such transgenic hosts are useful for the production of 
CEG gene products for the development of antibacterial agents such as antibiotics. 

25 The invention further provides substantially purified CEG gene products, and uses 
thereof. 

The invention also relates to pharmaceutical compositions comprising antisense 
molecules capable of disrupting expression of ceg sequences, agonists, antagonists or 
30 inhibitors of CEG gene products, and antibodies reactive against the CEG polypeptides. 



4 



WO 01/49721 



PCT/US00/35604 



These compositions are useful for preventing the growth or survival of bacteria, for 
example, in the treatment of conditions associated with bacterial infections. 

BRIEF DESCRIPTION OF THE FIGURES 

5 

Figure 1 : A schematic representation of the gene disruption assay, as described in Example 
3, infra. A) A recombinant vector undergoing homologous recombination with the host 
genome. B) The result of homologous recombination. 

10 Figure 2: A schematic representation of the polarity test for operons, as described in 
Examples 2 and 3, infra. A) The recombinant vector undergoing homologous 
recombination with the host genome. B) Case 1 : one possible result of homologous 
recombination; the downstream Gene B has an independent promoter. C) Case 2; another 
possible result of homologous recombination; the downstream Gene B does not have an 

1 5 independent promoter. 

Figure 3: Purification of 2CFE 75, as described in Example 6, infra. A) Fractionation 
profile of 2CFE 75 eluted from a Ni-NTA column. B) Gel electrophoresis of pooled 
fractions of CFE 75. C) Non-denaturing gel electrophoresis to determine oligo form of 
20 2CFE 75. 

Figure 4: Fractionation profile of 2CFE 3 eluted from a hydroxyapatite column, as described 
in Example 7, infra. 

25 Figure 5: The biosynthesis pathway of Coenzyme A which starts with phosphorylation of 
pantothenate. 

Figure 6: Circular dichroism spectra of 2CFE 101 and 103, as described in Example 10, 
infra. A) Circular dichroism spectra of 2CFE 101 and 103 at 25 degrees C. B) Circular 
30 dichroism thermal melt spectra of 2CFE 1 0 1 and 1 03 at a range of zero to 1 00 degrees C. 
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Figure 7: Circular dichroisnx spectra of aggregate and monomer pools of 2CFE 101 and 103, 
as described in Example 10, infra. A) Circular dichroism spectra of aggregate and monomer 
pools of 2CFE 101 and 103 at 25 degrees C, B) Circular dichroism thermal melt spectra of 
aggregate and monomer pools of 2CFE 101 and 103 at a range of zero to 100 degrees C. 

5 

Figure 8: Absorbance spectra of pantothenate-dependent production of ADP, as described in 
Example 10, infra.- 

Figure 9: The results of size exclusion chromatography and gel electrophoresis showing the 
10 oligomeric forms of 2CFE 21 and 39, as described in Example 1 1, infra. Lanes 1-6 contain 
.2CFE 2 1 , lane 7 is a molecular weight marker, lanes 8-10 contain 2CFE 39. 

Figure 10: Gel electrophoresis of a helicase reaction using 2CFE 21 and 39 and radiolabeled 
synthetic Holliday Junction template, as described in Example 11, infra. Lane 1 contains 

15 the synthetic Holliday Junction template; lane 2 contains the synthetic duplex; lane 3 
contains a single-stranded template; lane 4 contains the helicase reaction using 2CFE 39; 
lane 5 contains the helicase reaction using 2CFE 21; lanes 6-8 contain the helicase reaction 
using 2CFE 39 and 21 at varying concentrations (e.g., 1, 2, and 3 each); and lane 9 
contains the helicase reaction using 2 jiM each 2CFE 39 and 21 in the presence of ethidium 

20 bromide. 

Figure 1 1 : A graph depicting the results of the helicase reaction which were monitored by . 
measuring the unquenching of the Holliday Junction templates with time, as described in 
Example 1.1, infra. 

25 

Figure 12: Capillary electrophoresis results, of 2CFE 8 with and without ssDNA, as 
described in Example 12, infra. A) Electropherogram of 2CFE 8 alone; B) 
Electropherogram of 2CFE 8 in the presence of a 32-nucleotide single-stranded oligomer. 

30 Figure 13: Gel mobility shift assay of 2CFE 8, and 2CFE 8 in the presence of a single- 
stranded 32-mer, as described in Example 12, infra. A) An ethidium bromide-stained, 
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native, polyacrylamide gel containing 2CFE 8, and 2CFE 8 in the presence of a 32-mer. B) 
The same native, polyacrylamide gel stained with Coomassie. 

Figure 14: The N-acetyl glucosamine pathway putatively mediated by 2CFE 3 and 2CFE 
5 86, as described in Example 13, infra. 

Figure 15: Capillary electrophoresis results of 2CFE 3 with and without putative substrates, 
as described in Example 13, infra.. A) Electropherogram of 2CFE 3 with .and without 
glucosamine- 1 -phosphate. B) Electropherogram of 2CFE 3 with and without D-glucose-1- 
10 phosphate. C) Electropherogram of 2CFE 3 alone, 2CFE 3 and glucose- t-phosphate, and 
2CFE 3 and glucose-6-phosphate. D) Electropherogram of 2CFE 3 alone or in the presence 
of glucosamine- 1 -phosphate, glucosamine-6-phosphate, D-glucose, D(+) galactose, and ct- 
D-glucose-1 -phosphate. 

15 Figure 16: Capillary electrophoresis results of FITC-derivitized 2CFE 3 polypeptide with 
and without D-glucosamine-6-phosphate (substrate) to produce the product D-glucosamine- 
1 -phosphate, using laser-induced fluorescence, as described in Example 13, infra. 
Electropherogram of D-glucosamine-6-phosphate (putative substrate), 2CFE 3 reacted with 
D-glucosamine-6-phosphate, and the product glucosamine- 1 -phhosphate. 

20 

Figure 17: Gel electrophoresis of 2CFE 86 eluted from an Ni-NTA column, as described in 
Example 13, infra. 

Figure 18: HPLC analysis of a coupled reaction including 2CFE 3, 2CFE 86, and D- 
25 glucosamine-6-phosphate to produce the product, UDP-N-acetylglucosamine-1 -phosphate 
(UDPAG), as described in Example 13, infra. 

Figure 1 9: A fatty acid biosynthesis pathway. 
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Figure 20: Size exclusion chromatography to determine the molecular weight and 
oligomeric form of 2CFE 34, as described in Example 14, infra.. Selected eluted samples 
were sized by gel electrophoresis. 

Figure 21: Gel electrophoresis of 2CFE 41 eluted from aNi-NTA column, as described in 
Example 15, infra. 

Figure 22: Capillary electrophoresis results of 2CFE 40, 41, and 46, as described in Example 
15, infra. 

Figure 23: Depicts a schematic diagram of a ligand which binds 2CFE 34. The Iigand is 2- 
pheriyl-N-(3 corboxyl-4hydroxyphenyl) azabicyclo [4.3.0] riona-2, 8-diene. 

Figure 24: Depicts a schematic diagram of a ligand which binds 2CFE 43 . The ligand is N- 
(3, 5-dinitrobenzyl)-7-trifluoromethyl benza diaza furanolactone. 

Figure 25: Depicts a schematic diagram of a ligand which binds 2CFE 43. The ligand is 2- 
amino (N-para-methylphenyl sulfonamide)-3-phenylpropianic acid. 

Figure 26: A nucleic acid sequence of 2CFE1 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 27: A nucleic acid sequence of 2CFE2 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 28: A nucleic acid sequence of 2CFE3 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 29: A nucleic acid sequence of 2CFE4 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 
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Figure 30: A nucleic acid sequence of 2CFE5 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

5 Figure 31: A nucleic acid sequence of 2CFE6 deposited with the American Type Culture 
Collection as ATCC designation ' on December 20, 2000. 

Figure 32: A nucleic acid sequence of 2CFE7 deposited with the American Type Culture 
Collection as ATCC designation . on December 20, 2000. 

10 

Figure 33: A nucleic acid sequence of 2CFE8 deposited with the American Type Culture 
Collection as ATCC designation , on December 20, 2000. 

Figure 34: A nucleic acid sequence of 2CFE9 deposited with the American Type Culture 
1 5 Collection as ATCC designation on December 20, 2000. 

Figure 35: A nucleic acid sequence of 2CFE10 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

20 Figure 36: A nucleic acid sequence of 2CFE1 1 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 37: A nucleic acid sequence of 2CFE12 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

25 

Figure 38: A nucleic acid sequence of 2CFE13 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 39: A nucleic acid sequence of 2CFE14 deposited with the American Type Culture 
30 Collection as ATCC designation on December 20, 2000. 
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Figure 40: A nucleic acid sequence of 2CFE15 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 41 : A nucleic acid sequence of 2CFE16 deposited with the American Type Culture 
5 Collection as ATCC designation on December 20, 2000. 

Figure 42: A nucleic acid sequence of 2CFE17 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

1 0 Figure 43 : A nucleic acid sequence of 2CFE1 9 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 44: A nucleic acid sequence of 2CFE21 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

15 

Figure 45: A nucleic acid sequence of 2CFE24 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 46: A nucleic acid sequence of 2CFE25 deposited with the American Type Culture 
20 Collection as ATCC designation on December 20, 2000. 

Figure 47: A nucleic acid sequence of 2CFE26 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

25 Figure 48: A nucleic acid sequence of 2CFE27 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 49: A nucleic acid sequence of 2CFE28 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

30 
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Figure 50: A nucleic acid sequence of 2CFE29 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 51: A nucleic acid sequence of 2CFE30 deposited with the American Type Culture 
5 Collection as ATCC designation • ' ' on December 20, 2000. 

Figure 52: A nucleic acid sequence of 2CFE3 1 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

10 Figure 53: A nucleic acid sequence of 2CFE32 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 54: A nucleic acid sequence of 2CFE33 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

15 

Figure 55: A nucleic acid sequence of 2CFE34 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 56: A nucleic acid sequence of 2CFE35 deposited with the American Type Culture 
20 Collection as ATCC designation on December 20, 2000. 

Figure 57: A nucleic acid sequence of 2CFE36 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

25 Figure 58: A nucleic acid sequence of 2CFE37 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 59: A nucleic acid sequence of 2CFE38 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

30 
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Figure 60: A nucleic acid sequence of 2CFE39 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 61: A nucleic acid sequence of 2CFE40 deposited with the American Type Culture 
5 Collection as ATCC designation on December 20, 2000. 

Figure 62: A nucleic acid sequence of 2CFE41 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

10 Figure 63: A nucleic acid sequence of 2CFE42 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 64: A nucleic acid sequence of 2CFE43 deposited with the American Type Culture 
Collection as ATCC designation . on December 20, 2000: 

15 

Figure 65: A nucleic acid sequence of 2CFE44 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 66: A nucleic acid sequence of 2CFE45 deposited with the American Type Culture 
20 Collection as ATCC designation on December 20, 2000. 

Figure 67: A nucleic acid sequence of 2CFE46 deposited with the American Type Culture 
Collection as ATCC designation • on December 20, 2000. 

25 Figure 68: A nucleic acid sequence of 2CFE47 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 69: A nucleic acid sequence of 2CFE48 deposited with the American Type Culture 
Collection as ATCC designation . on December 20, 2000. 

30 
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Figure 70; A nucleic acid sequence of 2CFE49 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 71 : A nucleic acid sequence of 2CFE50 deposited with the American Type Culture 
5 Collection as ATCC designation on December 20, 2000. 

Figure 72: A nucleic acid sequence of 2CFE51 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

1 0 Figure 73 : A nucleic acid sequence of 2CFE52 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 74: A nucleic acid sequence of 2CFE53 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

15 

Figure 75: A nucleic acid sequence of 2CFE54 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 76: A nucleic acid sequence of 2CFE55 deposited with the American Type Culture 
20 Collection as ATCC designation on December 20, 2000. 

Figure 77: A nucleic acid sequence of 2CFE56 deposited with the American Type Culture ■ 
Collection as ATCC designation on December 20, 2000. 

25 Figure 78: A nucleic acid sequence of 2CFE57 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 79: A nucleic acid sequence of 2CFE58 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 
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Figure 80: A nucleic acid sequence of 2CFE59 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 81: A nucleic acid sequence of 2CFE60 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 82: A nucleic acid sequence of 2CFE61 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 83: A nucleic acid sequence of 2CFE62 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 84: A nucleic acid sequence of 2CFE64 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 85: A nucleic acid sequence of 2CFE65 deposited with the American Type Culture 
Collection as ATCC designation - on December 20,, 2000. 

Figure 86: A nucleic acid sequence of 2CFE66 deposited with the American Type Culture 
Collection as ATCC designation ' on December 20, 2000. 

Figure 87: A nucleic acid sequence of 2CFE67 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 88: A nucleic acid sequence of 2CFE68 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 89: A nucleic acid sequence of 2CFE69 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 
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Figure 90: A nucleic acid sequence of 2CFE70 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 91 : A nucleic acid sequence of 2CFE71 deposited with the American Type Culture 
5 Collection as ATCC designation on December 20, 2000. 

Figure 92: A nucleic acid sequence of 2CFE72 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

1 0 Figure 93 : A nucleic acid sequence of 2CFE75 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 94: A nucleic acid sequence of 2CFE76 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

15 

Figure 95; A nucleic acid sequence of 2CFE78 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 96: A nucleic acid sequence of 2CFE79 deposited with the American Type Culture 
■ 20 Collection as ATCC designation on December 20, 2000. 

Figure 97: A nucleic acid sequence of 2CFE80 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

25 Figure 98: A nucleic acid sequence of 2CFE81 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 99: A nucleic acid sequence of 2CFE82 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

30 

15 



WO 01/49721 



PCT/USOO/35604 



Figure 100: A nucleic acid sequence of 2CFE83 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 101 : A nucleic acid sequence of 2CFE84 deposited with the American Type Culture 
5 Collection as ATCC designation on December 20, 2000. 

Figure 102: A nucleic acid sequence of 2CFE85 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

1 0 Figure 1 03 : A nucleic acid sequence of 2CFE86 deposited with the American Type Culture 
Collection as ATCC designation , on December 20, 2000. 

Figure 104: A nucleic acid sequence of 2CFE87 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

15 

Figure 105: A nucleic acid sequence of 2CFE88 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 106: A nucleic acid sequence of 2CFE89 deposited with the American Type Culture 
20 Collection as ATCC designation on December 20, 2000. 

Figure 107: A nucleic acid sequence of 2CFE90 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

25 Figure 108: A nucleic acid sequence of 2CFE91 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 109: A nucleic acid sequence of 2CFE92 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 
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Figure 110: A nucleic acid sequence of 2CFE94 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 1 1 1 : A nucleic acid sequence of 2CFE95 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 1 12: A nucleic acid sequence of 2CFE96 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 1 13: A nucleic acid sequence of 2CFE97 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 114: A nucleic acid sequence of 2CFE99 deposited with the American Type Culture 
Collection as ATCC designation . on December 20, 2000. 

Figure 115: A nucleic acid sequence of 2CFE101 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

Figure 116: A nucleic acid sequence of 2CFE102 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

Figure 117: A nucleic acid sequence of 2CFE103 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

Figure 118: A nucleic acid sequence of 2CFE104 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

Figure 119: A nucleic acid sequence of 2CFE105 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 
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Figure 120: A nucleic acid sequence of 2CFE106 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

Figure 121: A nucleic acid sequence of 2CFE107 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

Figure 122: A nucleic acid sequence of 2CFE108 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

Figure 123: A nucleic acid sequence of 2CFE109 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

Figure 124: A nucleic acid sequence of 2CFE111 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

Figure 125: A nucleic acid sequence of 2CFE112 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

Figure 126: A nucleic acid sequence of 2CFE113 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

Figure 127: A nucleic acid sequence of 2CFE114 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

Figure 128: A nucleic acid sequence of 2CFE115 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

Figure 129: A nucleic acid sequence of 2CFE116 deposited with the American Type 
Culture Collection as ATCC designation ; on December 20, 2000. 
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Figure 130: A nucleic acid sequence of 2CFE117 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

Figure 131: Schematic structures of alkyloids which are ligands, for example, of 2CFE42. 

DETAILED DESCRIPTION OF THE INVENTION 

Definitions 

All scientific and technical terms used in this application have meanings commonly used in 
the art unless otherwise specified. As used in this application; the following words or 
phrases have the meanings specified. 

As used herein, a ceg nucleic acid molecule is said to be "isolated" when the nucleic acid 
molecule is substantially separated from contaminant nucleic acid molecules that encode 
polypeptides other than CEGs. Additionally, isolated nucleic acid molecule refers to any 
RNA or DNA sequence obtained from a natural source, or constructed by recombinant 
methods, or synthesized. A skilled artisan can readily employ nucleic acid isolation 
procedures to obtain an isolated nucleic acid molecule having ceg sequences. 

The term "ceg" includes all isolated forms of ceg nucleotide and CEG amino acid sequences 
disclosed herein. The ceg sequences encode gene products that have essential biological 
functions in bacterial cells, such as, for example, nucleotide biosynthesis, amino acid 
biosynthesis, DNA replication, RNA transcription, protein translation, DNA 
recombination, DNA repair, biosynthesis of cofactors (e.g., Coenzyme A), biosynthesis 
of prosthetic groups, cellular processes (e.g., chaperones, cell division, and polypeptide 
secretion), energy metabolism (e.g., pentose phosphate pathway, glycolysis, 
gluconeogenesis), fatty acid biosynthesis, cell wall biosynthesis, and/or biosynthesis of 
purines, pyrimidines, nucleosides, and nucleotides. Accordingly, the gene products of the 
ceg nucleotide sequences are required for viability of bacterial cells. The term "ceg J also 
includes variants having nucleotide sequence similarity to the disclosed ceg sequences, 



19 



WO 01/49721 



PCT/US00/35604 



including sequences isolated from various bacterial genera and species, allelic variants, 
mutant variants, and ceg variants that encode conservative and non-conservative amino acid 
substitutions. The present invention also provides for all ceg sequences generated by 
recombinant DNA technology, including complementary sequences, ceg sequences that 
5 hybridize to the sequences of the invention at high stringency hybridization conditions, 
fusion genes comprising a ceg sequence, and codon usage variants. 

The term "essential genes" refers to a nucleotide sequence that encodes a gene product 
having a function which is required for cell viability. The term "essential protein" refers 
10 to a polypeptide that is encoded by an essential gene and has a function that is required 
for cell viability. Accordingly, a mutation that disrupts the function of the essential gene 
or essential proteins results in a loss of viability of cells harboring the mutation. 

"Non-essential genes" or "non-essential proteins" refer to genomic information or the 
15 protein(s) or RNAs encoded therefrom which, when disrupted by a mutation, do not 
result in a loss of viability of cells harboring said mutation under defined laboratory 
conditions. 

As used herein, a nucleotide sequence is said to be "identical" to another reference 
20 sequence when both nucleotide sequences are exactly alike. 

As used herein, a nucleotide sequence is said to be "similar" to another reference 
sequence when a comparison of the two sequences shows that they have a low level of 
sequence differences. For example, two sequences are considered to be similar to each 
other when the percentage of nucleotides that are shared between the two sequences is 
between about 70 % to 99.99% over the entire length of the two sequences. 

As used herein an amino acid sequence is said to be "similar" to another reference 
sequence when a comparison of the two sequences shows that they have a low level of 
30 sequence differences. For example, two sequences are considered to be similar to each 
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other when the percentage of amino acids that are shared between the two sequences may 
be between about 30% to 100% identity over the entire length of the two sequences. 

As used herein, an "allele" or "allelic sequence" is an alternative form of the naturally- 
occurring ceg sequence. Alleles result from a mutation, that changes the nucleotide 
sequence, and generally produce altered mRNAs or polypeptides whose structure or 
function may or may not be altered. 

"Substantially purified" as used herein means a specific isolated nucleic acid or protein, 
or fragment thereof, in which substantially all contaminants (i.e. substances that differ 
from said specific molecule) have been separated from said nucleic acid or protein. 

In a host cell, an "endogenous" sequence as used herein means a nucleic acid sequence 
that is naturally-occurring and resides within the host genome. 

In a host cell, an "exogenous" sequence as used herein means an isolated nucleic acid 
sequence that is introduced into the host cell, using any one of a variety of introduction 
methods, such as transfection, electroporation, cationic lipid or salt treatment methods. 

"Knockout mutant" or "knockout mutation" as used herein refers to an in vitro engineered 
disruption of a region of endogenous chromosomal DNA (e.g., disruption of the genome), 
typically within a protein coding region. A knockout mutation can be generated by 
inserting an exogenous DNA sequence into the homologous endogenous sequence; A 
knockout mutation occurring in a protein coding region is expected to disrupt normal 
expression of the protein coding region. This usually leads to loss of the function 
provided by the protein. 

In order that the invention herein described may be more fully understood, the following 
description is set forth. 
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A) MOLECULES OF THE INVENTION 

1.) CEG NUCLEIC ACID MOLECULES 

5 The present invention provides isolated and recombinant ceg nucleic acid molecules and 
fragments thereof, and related molecules, such as sequences complementary to ceg 
sequences or a portion thereof, and those that hybridize to the nucleic acid molecules of 
the invention. 

10 The ceg polynucleotide sequences, also referred to herein as nucleic acid molecules of the 
invention, are preferably in isolated form, including DNA, RNA, DNA/RNA hybrids, and 
related molecules, and fragments thereof. Specifically contemplated are genomic DNA, 
ribozymes, and antisense molecules, as well as nucleic acid molecules based on an 
alternative backbone or including alternative bases, whether derived from natural sources or 

15 synthesized. Embodiments of particular ceg polynucleotide and amino acid sequences 
include, but are not limited to, the sequences described in Tables I and II (e.g., SEQ ID 
NOS:M13, 114-226 and SEQ ID NOS: 227-339, 340-452, respectively). The ceg 
polynucleotide and amino acid sequences were designated cfe which stands for CEG For 
Expression. 

20 

Biological samples of the 2CFE nucleic acid molecules (e.g., SEQ ID NOS: 227-331) 
were deposited on December 20, 2000 with the American Type Culture Collection 
(ATCC), 10801 University Blvd., Manassas, VA 201 10-2209. 
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a) Variant ceg Nucleotide Sequences 

The present invention also provides nucleic acid molecules having a nucleotide sequence 
5 substantially identical or similar to the ceg sequences (SEQ ID NOS: 1-113, 227-331) 
disclosed herein. 

The; present invention provides nucleotide sequences which are similar to SEQ ID 
NOS:l-113 and/or SEQ ID NOS:227-331. The present invention provides nucleotide 
10 sequences which vary from SEQ ID NOS: 1-1 13 or 227-331 by a range of about 1% to 
about 70%. 

The present invention encompasses variations in polynucleotide sequences resulting from 
mutations and/or from transfer of genetic material from one cell to another (e.g., 
1 5 horizontal gene transfer or horizontal gene exchange). 

The present invention also provides for variants of the polynucleotide ceg sequences 
disclosed herein, including variants isolated from naturally-occurring sources, those 
generated by recombinant DNA technology or other in vitro synthesis methodologies 
20 (e.g., PCR). The variant polynucleotide sequences of the invention encode polypeptides 
that exhibit the biological activity of naturally-occurring CEG polypeptides, such as 
activity required for bacterial cell viability. 

In general, for example, a variant of ceg polynucleotide sequences may encode a 
25 polypeptide that differs by one or more amino acid substitutions. The variant may have 
conservative changes, wherein a substituted amino acid has similar structural or chemical 
properties, eg, replacement of leucine with isoleucine. 

A polynucleotide sequence can encode conservative amino acid substitutions without 
30 altering either the conformation or the function of the polypeptide. Such changes include 
substituting any of isoleucine (I), valine (V), and leucine (L) for any other of these 
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hydrophobic amino acids; aspartic acid (D) for glutamic acid (E) and vice versa; 
glutamine (Q) for asparagine (N) and vice versa; and serine (S) for threonine (T) and vice 
versa. Other substitutions can also be considered conservative, depending on the 
environment of the particular amino acid and its role in the three-dimensional structure of 
5 the protein. For example, glycine (G) and alanine (A) can frequently be interchangeable, 
as can alanine (A) and valine (V). Methionine (M), which is relatively hydrophobic, can 
frequently be interchanged with leucine and isoleucine, and sometimes with valine. 
Lysine (K) and arginine (R) are frequently interchangeable in locations in which the 
significant feature of the amino acid residue is its charge and the differing pK's of these 
10 two amino acid residues are not significant. Still other changes can be considered 
"conservative" in particular environments. 

A variant may also have nonconservative changes, eg, replacement of a glycine with a 
tryptophan. Other variations may also include amino acid deletions or insertions, or both. 
15 Guidance in determining which and how many amino acid residues may be substituted, 
inserted or deleted without abolishing biological or immunological activity may be found 
using computer programs well known in the art, for example, DNASTAR software. 

Another type of ceg sequence variant includes naturally-occurring allelic variants of ceg 
20 which share significant similarity (e.g., between about 30- 99%) to the disclosed CEG 
polypeptide sequence. Allelic variants of the ceg sequences can encode conservative or 
non-conservative amino acid substitutions of the CEG polypeptide sequence herein 
described. 

25 An example of allelic variants of ceg are mutant alleles of ceg polynucleotide sequences that 
encode a polypeptide having one or more changes in the polypeptide sequence, such as 
amino acid substitutions, deletions, insertions, frame shifts, or truncations. The mutant 
alleles of ceg may or may not encode a CEG polypeptide having the same biological 
functions as wild-type CEG proteins. 

30 
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Variations in the bacterial genomic sequences can also arise from transfer of genetic 
material to another bacterial cell. The transfer of gene sequences can occur intraspecies 
or interspecies. Gene transfer can occur between bacterial cells which are members of 
the same or different populations. A population includes, but is not limited to, a serotype 
isolate, a clinical isolate, a naturally-occurring isolate, a strain, and a species. The 
transfer of genetic material can occur between cells within a population; for example 
transfer between serotype A to serotype A, or between S. pneumoniae and S. 
pneumoniae. The transfer of genetic material can occur between cells of different 
populations; for example, between serotype A to serotype B or S. pneumoniae and S. 
mutans. 

Gene transfer can give rise to mutant or polymorphic variant genes sequences. In rare 
cases, gene transfer introduces new gene sequences that confer a new phenotypfe, such as 
antibiotic resistance. The transfer of genetic material includes transfer of large regions of 
genomic sequences which include partial gene sequences, whole single gene sequences, 
or multiple gene sequences. This mode of transfer can give rise to replacement of native 
whole gene sequences or introduction of new sequences in the recipient cell. This mode 
of transfer gives rise to mosaic gene sequences in the recipient cell. 

The variation of genomic sequences resulting from gene transfer can be examined using 
molecular techniques, including: multilocus enzyme electrophoresis (Selander. R. K., et 
al, 1986 Appl Environ. Microbiol 51:837-884); and restriction endonuclease cleavage 
electrophoretic profiling (Coffey, T. J., et al, 1991 Mol Microbio. 5:2255-2260); pulse- 
field gel electrophoresis fingerprinting (Bygraves, J. A. and Maiden, M. C. J. 1992 J. 
Gen. Microbiol. 138:523-531); and ribotyping (Stull, T. L., et al., 1988 J. Infect. Dis. 
157:280-286). The degree of variation can vary greatly, and ranges from little or no 
variation as exemplified by gene sequences of E. coli (Caugant, d. A. 3 et al., 1981 
Genetics 98:467-490; Whittam, T. S., et al., 1983 Mol Biol Evol 1:67-83; Souza, V., et 
al., 1992 Proc. Natl Acad Sci. USA 89:8389-8393) and Salmonella (Selander, R. K., et 
al., 1990 Infect. Immun. 58:2262-2275; Selander, R.K. and Smith, N r H. 1990 Rev. Med. 
Microbiol 1:219-228; Smith, J. M, et al., 1993 Proc. Natl Acad ScL USA 90:4384- 
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4388), to extensive gene transfer in Neisseria gonorrhoeae (Smith, J. M. 5 et al., 1993 
Proc. Natl Acad. Sci. USA 90:4384-4388). 

Gene transfer can be examined between various isolates of a particular microbial species 
which are antibiotic-sensitive or antibiotic-resistent (Coffey, T. J., et al., 1991 Molec. 
Microbiol 5:2255-2260). Molecular biology techniques can be utilized to study the 
degree of transfer between populations, such as, for example, the degree of gene transfer 
between serotypes, isolates, strains,* or species . The degree of transfer can be examined 
by comparing, for example, the penicillin binding proteins and numerous different loci 
which encode metabolic enzymes or capsular biosynthesis enzymes. 

For example, intra-species, inter-serotype, gene transfer is possible (Coffey, T. J., et al., 
1991 supra). Additionally, intraspecies gene transfer in S. pneumoniae (Coffey, T. J., et 
al, 1998 Mol Microbiol 27:73-83), Vibrio cholerae (Bik, E. M., et al, 1995 EMBO J. 
14:209-216), and Haemophilus influenzae (Kroll, J. S. and Moxon, E. R. 1990 J. 
Bacteriol 172: 1374-1379) are possible. 

Interspecies gene transfer is also possible (Dowson, C. G., et al., 1989 Proc. Natl Acad 
Sci. USA 86:8842-8846; Laibl, G., et al., 1991 Mol Microbiol 5:1993-2002; Bourgoin, 
F., et al., 1999 Gene 233:151-161). 

Variant gene sequences arising from gene transfer can be continually generated in 
transformable bacteria (e.g., transformation competent), such as S. pneumoniae. For 
example, the worldwide spread of varying degrees of antibiotic resistance has. been 
documented and reviewed (Dowson, C. G., et al., 1994 Trends Microbiol 2:361-366; 
Spratt, B. G. in Bacterial Cell Wall, eds Ghuysen J-M. and Hakenbeck, R. 1994 pp. 517- 
534; and reviewed in Maiden, M. C. J. 1998 Clinic. Infect Dis. 27 (Supplement 1) S12- 
S20). For example, variant gene sequence arising from gene transfer can be tracked 
using a marker gene such as the gene which encodes the penicillin binding protein 
(Barcus, V. A., et al., 1995 FEMS Microbiol Lett. 126:299-303). At the nucleotide level, 
gene sequences encoding the penicillin binding proteins in susceptible and resistant 
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strains differ by about 14% to 23% (Hakenbeck, R. 1995 Biochem. Pharmacol 50:1121- 
1 127; Spratt, B. G. in Bacterial Cell Wall, eds Ghuysen J-M. and Hakenbeck, R. 1994 pp. 
517-534; Spratt, B. G., et al., 1991 Neisseria meningitidis and Streptococcus pneumoniae 
eds. Camisi, I, et al., pp. 73-83; Coffey, T. J., et al., 1 995 Micro. Drug Resist. 1 :29-34). 

5 

The ceg nucleotide sequences can be isolated from various species of Streptococcus 
including Streptococcus pneumoniae. Additionally, the ceg sequences can be isolated from 
other Steptococcal species, including S. mutans, S. pyogenes, and & thermophila t The ceg 
polynucleotide sequences can also be isolated from strains of other bacterial genera 
10 including, but not limited to, Streptococcus* Escherichia, Bacillus, Pseudomonas, 
Yersinia, Salmonella, and Haemophilus. 

The present invention additionally provides isolated codon-usage variants that differ from 
the disclosed ceg nucleotide sequences, yet do not alter the predicted CEG polypeptide 

15 sequence or function. The codon-usage variants may be generated by recombinant DNA 
technology. Codons may be selected to optimize the level of production of the ceg 
transcript or CEG polypeptide in a particular prokaryotic or eukaryotic expression host, 
in accordance with the frequency of codon utilized by the host cell. Alternative reasons 
for altering the nucleotide sequence encoding a CEG polypeptide include the production 

20 of RNA transcripts having more desirable properties, such as an extended half-life or 
increased stability. A multitude of variant ceg nucleotide sequences that encode the 
respective CEG polypeptide may be isolated, as a result of the degeneracy of the genetic 
code. Accordingly, the present invention contemplates selecting every possible triplet 
codon to generate every possible combination of nucleotide sequences that encode the 

25 disclosed CEG polypeptides. This particular embodiment provides isolated nucleotide 
sequences that vary from the sequences as described in SEQ ID NOs.: 1-113 or 227-331, 
such that each variant nucleotide sequence encodes a polypeptide having sequence 
identity with the amino acid sequences, as described in SEQ ID NOs.rl 14-226 or 332- 
436, respectively. 

30 
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b) Complementary Sequences 

The present invention includes polynucleotide sequences that are complementary to the 
sequences disclosed herein. The term "complementary" as used herein refers to the 
capacity of purine and/or pyrimidine nucleotides to associate through hydrogen bonding 
to form double stranded nucleic acid molecules. The following base pairs are related by 
complementarity: guanine and cytosine; adenine and thymine; and adenine and uracil. 
Complementary applies to all base pairs comprising at least two single-stranded nucleic 
acid molecules. 

c) Sequences Capable of Hybridizing 

Another embodiment provides nucleic acid molecules that will hybridize to ceg 
sequences under hybridization conditions. It is readily apparent to one skilled in the art 
that the stringency of the hybridization condition selected will depend upon the 
characteristics of the nucleic acid molecule to be hybridized, such as, the length, the 
degree of complementarity (e.g., exact or non-exact complementarity), the percent A/T 
content, and the objective of the hybridization experiment. 

The hybridization procedure may by performed in low stringency hybridization 
conditions. Low stringency hybridization conditions will permit hybridization between 
two nucleic acid molecules that differ from exact complementarity by about 25% to 70%. 
Hybridization under standard high stringency conditions will occur between two 
complementary nucleic acid molecules (e.g., 100% exact complementarity) or two 
complementary nucleic acid molecules that differ from exact complementarity by about 
1% to about 70%. 

The high stringency hybridization conditions that disfavor non-homologous base pairing 
are well known in the art. Typically, high stringency hybridization conditions, includes 
but is not limited to, hybridizing at 50 °C to 65 °C in 5X SSPE, and washing at 50 °C to 
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65 °C in 0.5X SSPE. Typically, low stringency conditions, includes but is not limited to, 
hybridizing at 35 °C to 37 °C in 5X SSPE and 40% to 45% formamide and washing at 42 
°C in 1-2X SSPE. The conditions and formulas for high stringency hybridization 
methods are well known in the art and can be readily obtained in Molecular Cloning; A 
5 Laboratory Manual (2 nd edition, Sambrook, Fritch, and Maniatis 1989, Cold Spring 
Harbor Press) or in Short Protocols in Molecular Biology (Ausubel, F. M., et al., 1989, 
John Wiley & Sons). • 

d) Fragments of ^Sequences 

10 

The invention further provides nucleic acid molecules having fragments of the ceg 
sequences, such as a portion of the ceg sequence (e.g., SEQ ID NOS:l-113, 227-331) 
disclosed herein. The size of the fragment will be determined by its intended use. For 
example, the length of the fragment to be used as a nucleic acid probe or PCR primer is 
15 chosen to obtain a relatively small number of false positives during probing or priming. 
Alternatively, a fragment of the ceg sequence may be used to construct a recombinant fusion 
gene having a ceg sequence fused to a non-ceg sequence. 

The nucleic acid molecules, fragments thereof, and probes and primers of the present 
20 invention are useful for a variety of molecular biology techniques including, for example, 
hybridization screens of libraries, or detection and quantification of mRNA transcripts as 
a means for analysis of gene transcription and/or expression. Preferably, the probes and 
primers are DNA. A probe or primer length of at least 15 base pairs is suggested by 
theoretical and practical considerations (Wallace, B. and Miyada, G. 1987 
25 "Oligonucleotide Probes for the Screening of Recombinant DNA Libraries" in: Methods • 
in Enzymology, 152:432-442, Academic Press). Other lengths of fragments, probes, or 
primers are possible and routine to determine. 

• The probes and primers of this invention can be prepared by methods well known to 
30 those skilled in the art (Sambrook, et' al. supra). In a preferred embodiment the probes 
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and primers are synthesized by chemical synthesis methods (ed: Gait, M. J. 1984 
Oligonucleotide Synthesis, IRL Press, Oxford, England). 

One embodiment of the present invention provides nucleic acid primers that are 
5 complementary to ceg sequences, which allow the specific amplification of nucleic acid 
molecules of the invention or of any specific parts thereof. Another embodiment 
provides nucleic acid probes that are complementary for selectively or specifically 
hybridizing to the ceg sequences or to any part thereof. 

1 0 e) Derivative Nucleic Acid Molecules 

The nucleic acid molecules of the invention include peptide nucleic acids (PNAs), or 
derivative molecules such as phosphorothioate, phosphotriester, phosphoramidate, and 
methylphosphonate, that specifically bind to single-stranded DNA or RNA in a base pair- 
15 dependent manner (Zamecnik, P. C. s et al., 1978 Proc. Natl. Acad. Sci. 75:280284; 
Goodchild, P. C, et al., 1986 Proa Natl Acad Set 83:4143-4146). 

PNA molecules comprise a nucleic acid oligomer to which an amino acid residue, such as 
lysine, and an amino group have been added. These small molecules, also designated 

20 anti-gene agents, stop transcript elongation by binding to their complementary (template) 
strand of nucleic acid (Nielsen, P. E., et al., 1993 Anticancer Drug Des 8:53-63). For 
example, reviews of methods for synthesis of DNA, RNA, and their analogues can be 
found in : Oligonucleotides and Analogues, eds. F. Eckstein, 1991, IRL Press, New York; 
Oligonucleotide Synthesis, ed. M. J. Gait, 1984, IRL Press, Oxford, England. 

25 Additionally, methods for antisense RNA technology are described in U. S. patents 
5,194,428 and 5,1 10,802. A skilled artisan can readily obtain these classes of nucleic acid 
molecules using the herein described ceg polynucleotide sequences, see for example 
Innovative and Perspectives in Solid Phase Synthesis (1992) Egholm, et al. pp 325-328 or 
U.S. Patent No. 5,539,082. 

30 
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f) RNA Molecules 

The present invention provides RNA molecules that encode the predicted ceg gene 
5 products. In particular, the RNA molecules of the invention may be isolated full-length 
or partial mRNA molecules or RNA oligomers that encode CEG gene products. The 
RNA molecules of the invention include the nucleotide sequences encoding all or 
portions of CEGs. 

10 The RNA molecules of the invention also include antisense RNA molecules, peptide 
nucleic acids (PNAs), or non-nucleic acid molecules such as phosphorothioate 
derivatives, that specifically bind to the. sense strand of DNA or RNA in a base pair- 
dependent manner. A skilled artisan can readily obtain these classes of nucleic acid 
molecules using the herein described ceg sequences. 

15 

g) Labeled Nucleic Acid Molecules 

The nucleic acid molecules having ceg sequences can be labeled with a detectable 
marker. Examples of a detectable marker include, but are not limited to, a radioisotope, a 
20 ' fluorescent compound, a bioluminescent compound, a chemiluminescent compound, a 
metal chelator or an enzyme. Technologies for generating labeled DNA and RNA probes 
are well known in the art (See e.g. Sambrook et al., supra). 

2.) RECOMBINANT NUCLEIC ACID MOLECULES 

25 

Also provided are recombinant nucleic acid molecules, such as recombinant DNA molecules 
(rDNAs) that comprise ceg sequences or fragments thereof. As used herein, a recombinant 
DNA molecule is a DNA molecule that has been subjected to molecular manipulation in vitro. 
Methods for generating rDNA molecules are well known in the art, for example, see Sambrook 
30 et al., Molecular Cloning (1 989), supra. 
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a) Vectors 

The nucleic acid molecules of the invention may be recombinant molecules each 
comprising the sequence, or portions thereof, of a ceg sequence linked to a non-ceg 
sequence. For example, the ceg sequence may be fused operatively to a vector to 
generate a recombinant molecule. The term vector includes, but is not limited to, 
plasmids, cosmids, and phagemids. A preferred vector includes an autonomously 
replicating vector comprising a replicon that directs the replication of the rDNA within the 
appropriate host cell. The preferred vectors can also include an expression control 
element, such as a promoter sequence, which enables transcription of the inserted ceg 
sequences and can be used for regulating the expression (e.g., transcription and/or 
translation) of an operably linked ceg sequence in an appropriate host cell such as 
Escherichia colt Expression control elements are known in the art and include, but are not 
limited to, inducible promoters, constitutive promoters, secretion signals, enhancers, 
transcription terminators, and other transcriptional regulatory elements. Other expression 
control elements that are involved in translation are known in the art, and include the Shine- 
Dalgamo sequence, and initiation and termination codons. The preferred vector also 
includes at least one selectable marker gene that encodes a gene product that confers drug 
resistance such as resistance to ampicillin or tetracyline. The vector also comprises 
multiple endonuclease restriction sites that enable convenient insertion of exogenous 
DNA sequences. 

The preferred vectors for generating ceg transcripts and/or the encoded CEG polypeptides 
are expression vectors which are compatible with prokaryotic host cells. Prokaryotic cell 
expression vectors are well known in the art and are available from several commercial 
sources. For example, a pET vectors (e.g., pET-21, Novagen Corp.) may be used to 
express CEG polypeptides in bacterial host cells. 
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b) Recombinant Vectors for Integration 

The present invention provides recombinant vectors that may be used to integrate 
exogenously provided sequences into the genome of a host cell. The recombinant 
integration vectors of the present invention include a gene that encodes a selectable 
marker and ceg sequences, or fragments thereof. The integration vectors are used to 
integrate the ceg sequence into a target gene sequence that resides within the bacterial 
host genome (e.g., endogenous sequence), thereby disrupting the function of the target 
gene sequence within the bacterial cells. These integration vectors may be used in a gene 
disruption assay to screen candidate ceg nucleotide sequences, in order to identify the 
candidate sequences that encode a gene product that is required for bacterial cell viability. 

Accordingly, these recombinant integration vectors include candidate ceg sequences that 
will be screened to determine if the candidate ceg sequences encode a gene product that 
is required for cell viability. The candidate ceg sequence that is included as part of the 
recombinant integration vector is the "exogenous" ceg sequence that is employed as the 
"disrupting" sequence in a gene disruption assay. The ceg sequence that resides within 
the host genome is the "endogenous" or "target" ceg sequence. 

The integration event rarely occurs, for example, by non-homologous recombination in 
which a recombinant vector, that includes the exogenous ceg sequence, inserts the 
exogenous ceg sequence into a random location within the host genome. In a more 
preferred embodiment, the integration event inserts the exogenous ceg sequence into a 
specific target site within the host genome. The targeted integration event can involve 
homologous recombination in which the integration vector, that includes the exogenous 
ceg sequence, inserts the exogenous ceg sequence into its homologous target ceg 
sequence that resides within the host's genome (e.g., the endogenous ceg sequence) 
(Figure 1). Further, the exogenous ceg sequence can be used as a disrupting sequence 
whereby the homologous recombination event integrates the exogenous ceg sequence 
into the endogenous target ceg sequence resulting in disruption of the function of the 



39 



WO 01/49721 



PCT/US00/35604 



endogenous ceg sequence. For example, disrupting the function of the endogenous ceg 
sequence may result in the loss of bacterial cell viability. 

An example of a recombinant vector that can be used as an integration vector in S. 
5 pneumoniae is the pEVP-3 vector (Jean-Pierre Claverys, et al. 1995 Gene 164: 123-128). 
The pEVP-3 vector integrates an exogenous sequence by homologous recombination 
involving a Campbell-type event (S. Adhya and A. Campbell 1970 1 Mol Biol. 50:481- 
490). The pEVP-3 vector includes a replicon that functions only in gram-negative 
bacteria, such as E. coll Therefore, the pEVP-3 vector cannot replicate in S, 
10 . pneumoniae. This vector also contains multiple cloning sites, and confers resistance to 
chloramphenicol in both a gram-negative and gram-positive bacteria, such as S. 
pneumoniae. 

c) Fusion Gene Sequences 

15 

A fusion ceg gene is another example of a recombinant molecule of the invention. A fusion 
gene includes a ceg sequence operatively fused (e.g., linked) to a non-ceg sequence such as, 
for example, a tag sequence to facilitate isolation and/or purification of the expressed 
CEG gene product (Kroll, D.J., et al., 1993 DNA Cell Biol 12:441-53). 

20 

Alternatively, a recombinant fusion molecule has a ceg sequence of the invention fused to 
a ceg sequence isolated from a different microbial source. For example, the disclosed ceg 
sequences isolated from S. pneumoniae can be fused to a ceg sequence isolated from a 
different bacterial species. 

25 

3.) CEG PROTEINS AND POLYPEPTIDE MOLECULES 

The invention additionally provides CEG proteins and peptide fragments thereof that are 
isolated or substantially purified. Embodiments of particular CEG amino acid sequences 
30 are disclosed in Tables I and II (SEQ ID NOS:l 14-226 and SEQ ID NOS:332-436, 
respectively). 
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The present invention also includes polypeptides having sequence variations from the 
predicted CEG polypeptide sequences disclosed herein, including mutant variants, 
conservative substitution variants, and similar CEG polypeptides from other prokaryotic 
organisms. For convenience, such proteins are referred to herein as "CEG proteins", 
"CEG polypeptides", or "proteins of the invention". 

As used herein, CEG protein refers to a polypeptide having amino acid sequence identity or 
similarity to any one of the predicted amino acid sequences, as provided in SEQ ID NO.: 
1 14-226 or 332-436. The variant CEG polypeptides can be allelic forms of CEG, such as 
mutant forms of CEG polypeptides. The present invention also provides conservative 
substitution-mutants of the CEG proteins that maintain functional activity of wild-type CEG 
(e.g., the CEG polypeptide is required for bacterial cell viability). 

The CEG protein may be isolated from any source whether natural, synthetic, semi- 
synthetic, or recombinant. As used herein, "natural" refers to a polypeptide which is 
found in nature. Accordingly, the CEG proteins may be isolated from a prokaryotic 
organism, such as a bacterial strain including, but not limited to, Streptococcus, 
Escherichia, Bacillus, Pseudomonas, Yersinia, Salmonella, and Streptomyces. The CEG 
proteins of the invention, and fragments thereof, can also be generated by recombinant 
methods or chemical synthesis methods. 

The CEG polypeptides of the invention are essential for the viability of a bacterial cell. 
Further, the CEG polypeptides can exhibit at least any one of the following functions: a 
pantothenate kinase, a Holliday Junction branch migration protein, a single stranded 
DNA binding protein, a phosphoglucosamine mutase, an acetyltransferase, an 
uridylyltrarisferase, a malonyl CoenzymeA:ACP transcylase, a 3-oxoacyl-ACP synthase 
II, a 3-oxoacyl-ACP reductase, a phosphomethylpyrimidine (HMP-P) kinase, a GTP 
binding protein, a ATP binding protein, or a 4-aminoimidazole carboxylase. Putative 
functions can include, but are not limited to, sugar transferase, techoic acid biosynthesis, 
ribosome recycling factor, response regulator, nicotinate phosphoribosyltransferase, 



41 



WO 01/49721 



PCT/US00/35604 



nitropropane dioxygenase, (3SL)-hydroxymyristol acyl carrier protein dehydrase, sugar 
dehydrogenase, mxirein- ^biosynthesis, cobalimin biosynthesis, ABC transporter, tRNA 
modification enzyme, arylsulfatase, 16S processing enzyme, tRNA methyl transferase, 
elongation factor P, signal recognition particle, protein export, undecaprenol kinase, SRP 
docking domain, diacyl glycerol kinase, dihydopicilinate reductase, HU-DNA binding 
protein, thiamine biosynthase, GreA transcription elongation factor, dTDP-L-rhamnose 
synthase, ATP-binding motif, ribose-5-p-3-epimerase-like activity, GTP 
pyrophosphokinase^, acetyl-Cb A carboxylase, O-sialoglycoprotein endopeptidase, 
glucosamine-fhictose-6-phosphase aminotransferase, Strpn adhesion-associated ABC- 
permease, GTP pyrophosphokinase RelA, IMP dehydrogenase, DNA gyrase subunit B, 
acetyl-CoA carboxylase subunit AccD, phosphoglycerol kinase, acetyl-CoA carboxylase 
carbonyl transferase,, phosphopanthetheine adenylyltransferase, oligopeptide transport 
permease subunit, translocation protein, perM permease, DNA pol III gamma and tau 
subunits, DNA pol in delta subunit, signal peptidase I, acetyl-coA carboxylase biotin 
carboxyl carrier protein, protein chain release factor- 1, replicative DNA helicase, 
topoisomerase, pentapeptide-transferase, elongation factor G, spore coat polysaccharide 
biosynthesis protein C, protein release factor B, DNA polymerase HI alpha subunit, 
phosphoprotein phosphatase, chaparonin, UDP-N-acetylmuramoylalanyl-D-glutamate-2, 
6-diaminopimelate ligase, techuronic acid biosynthesis, UDP-glucose lipid carrier 
transferase, transcription termination factor, chromosome segregation factor, amino acid 
biosynthesis, HMG-CoA reductase, hypoxanthine-guanine phosphoribosyltransferase. 

a) MODULATORS OF CEG POLYPEPTIDES 

The invention provides compounds that modulate (e.g., activate or inhibit) the function of 
a CEG polypeptide. Such compounds can provide lead-compounds for developing drugs 
for diagnosing and/or treating conditions associated with bacterial infections. The 
modulator is a compound that may alter the function of the CEG polypeptide, such as 
activating or inhibiting the function of a CEG polypeptide. For example, the compound 
can act as agonist, antagonist, partial agonist, partial antagonist, cytotoxic agents, 
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inhibitors of cell proliferation, and cell proliferation-promoting agents. The activity of 
the compound may be known, unknown or partially known. 

Suitable ligands include, but are not limited to, diazalactones, ^protected amino acid, 
azabicyclodiene, and alkaloids. 

* An example of a diazalactone is: 




O 



An example of a N-protected amino acid is: 




O- 



An example of an azabicyclodiene is: 
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5 B) METHODS FOR MAKING THE CEG PROTEINS AND POLYPEPTIDES 

Recombinant methods are preferred if a high yield is desired. Recombinant , methods 
involve expressing the cloned gene in a suitable host cell. For example, a host cell is 
introduced with an expression vector having the CEG sequence, then the host cell is 
10 cultured under conditions that permit in vivo production of the CEG protein. The 
recombinant vector can integrate the CEG sequence into the host genome. Alternatively, 
the CEG sequence can be maintained extra-chromosomally, as part of ah autonomously 
replicating vector. 

15 1. HOST-VECTOR SYSTEMS 

The invention further provides a host-vector system comprising the vector, plasmid, 
phagemid, or cosmid comprising a ceg nucleotide sequence, or a fragment thereof, 
introduced into a suitable host cell. The host-vector system can be used to produce the 
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CEG polypeptides encoded by the ceg nucleotide sequences. The host c£ll can be 
prokaryotic or eukaryotic. Examples of suitable prokaryotic host , cells include;, bacteria 
strains from genera such as Escherichia, Bacillus, Pseudomonas, Streptococcus, and 
Streptomyces. Examples of suitable eukaryotic host cells include a yeast cell, a plant cell, 
5 or an animal cell, such as a mammalian cell. A preferred embodiment provides a host- 
vector system comprising the pET21 vector having a ceg sequence introduced into an E. 
coli ADE3 lysogen which is useful, for example for the production of the CEG protein, 
herein designated CFE polypeptides and CFE proteins. 

10 Introduction of the rDNA molecules of the present invention into an appropriate cell host is 
accomplished by well known methods that typically depend on the type of vector used and 
host system employed. For example, transformation of prokaryotic host cells by 
electroporation and salt treatment methods are typically employed, see for example, Cohen 
et al., 1972 Proc AcadSci USA 69:2110; Maniatis, T., et al., 1989 Molecular Cloning, A 

15 Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY. 
Transformation of vertebrate cells with vectors containing rDNAs, electroporation, cationic 
lipid or salt treatment methods are typically employed, see, for example, Graham et al, 
1973 Virol 52:456; WigleretaL, 1979 Proc Natl Acad Sci USA 76:1373-76. 

20 Successfully transformed cells, i.e., cells that contain a rDNA molecule of the present 
invention, can be identified by well known techniques. For example, cells resulting from 
the introduction of a rDNA of the present invention can be selected and cloned to produce 
single colonies. Cells from those colonies can be harvested, lysed and their DNA content 
examined for the presence of the rDNA using a method such as that described by Southern, 

25 JMol Biol (1975) 98:503, or Berent et al., Biotech (1985) 3:208, or the proteins produced 
from the cell assayed via a biochemical assay or immunological method. 

Procaryotes are generally used as host cells for cloning and producing the products of 
exogenous DNA sequences. For example, the Escherichia coli K12 BL21 (A,DE3) 
30 (Novagen) is particularly useful for expression of foreign proteins. Other strains of E. 
coli, and bacilli such as Bacillus subtilis y Enterobacteriaceae such as Salmonella 
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typhimurium or Serratia marcescans, various Pseudomonas, Streptococcus, and . 
Streptomyces species may also be employed as host cells in cloning and expressing the* 
recombinant proteins of this invention. 

In general terms, the production of recombinant CEG proteins may involve using a 
host/vector system, or other methods may be used. The host/vector system may employ the 
following steps. 

A nucleic acid molecule is obtained that encodes a CEG protein or a fragment thereof, such 
as any one of the polynucleotides disclosed in SEQ ID NOs.: 1-113 or 227-33 1. The CEG- 
encoding nucleic acid molecule is preferably inserted into an expression vector in operable 
linkage with suitable expression control sequences, to generate an expression vector 
including the CEG-encoding sequence. The expression vector is introduced into a suitable 
host, by standard transformation methods, and the resulting transformed host is cultured 
under conditions that allow the production of the CEG protein. For example, if expression 
of the CEG gene is under the control of an inducible promoter, then suitable growth 
conditions would include the appropriate inducer. The CEG protein (e.g., designated a 
CFE polypeptide or protein), so produced, is isolated from the growth medium or directly 
from the cells; recovery and purification of the protein may not be necessary in some 
instances where some impurities may be tolerated. A skilled artisan can readily adapt an 
appropriate host/expression system known in the art for use with CEG-encoding sequences 
to produce a CEG protein (Cohen, et al. , supra; Maniatis et al., supra). 

Host cells harboring the nucleic acids disclosed herein are also provided by the present 
invention. A preferred host is E. coli strain BL21(ADE3) transfected or transformed with 
a vector comprising a nucleic acid of the present invention. The invention also provides a 
host cell capable of expressing the ceg sequences described herein. The preferred host 
cell is any strain of E. coli that can accommodate high level expression of an exogenously 
introduced gene. 
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The proteins of the present invention can also be made by chemical synthesis. The 
principles of solid phase chemical synthesis of polypeptides are well known in the art and 
may be found in general texts relating to this area (Dugas, H. and Penney, C. 1981 
Bioorganic Chemistry, pp 54-92, Springer- Verlag, New York). CEG polypeptides may 
be synthesized by solid-phase methodology utilizing an Applied Biosystems 430A 
peptide synthesizer (Applied Biosystems, Foster City, Calif.) and synthesis cycles 
supplied by Applied Biosystems. Protected amino acids, such as t-butoxycarbonyl- 
protected amino acids, and other reagents are commercially available from many 
chemical supply houses. 

The polypeptides of the invention exhibit properties of a CEG protein, such as, for 
example, the ability to elicit the generation of antibodies that specifically bind an epitope 
associated with CEG polypeptides. Accordingly, the CEG polypeptide, or any 
oligopeptide thereof, is capable of inducing a specific immune response in appropriate 
animals or cells and binding with specific antibodies. 

Q ANTIBODIES THAT RECOGNIZE AND BIND THE PROTEINS AND 
POLYPEPTIDES OF THE INVENTION 

The invention further provides antibodies (e.g., polyclonal, monoclonal, chimeric, 
humanized, and human antibodies) that bind a CEG polypeptide. The most preferred 
antibodies will selectively bind a CEG polypeptide and will not bind (or will bind weakly) a 
non-CEG polypeptide. Antibodies that are particularly contemplated include monoclonal 
and polyclonal antibodies, as well as fragments thereof (e.g., recombinant proteins) which 
include the antigen binding domain and/or one or more complement determining regions of 
these antibodies. These antibodies can be from any source, for example, rabbit, sheep, rat, 
dog, cat, pig, horse, mouse, and human. 

The invention encompasses antibody fragments that specifically recognize a CEG 
polypeptide. As used herein, an antibody fragment is defined as at least a portion of the 
variable region of the immunoglobulin molecule that binds to its target, i.e., the antigen 
binding region. Some of the constant region of the immunoglobulin may be included. 
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As will be understood by those skilled in the art, the regions or epitopes of a CEG 
polypeptide to which an antibody is directed may vary with the intended application. For 
example, antibodies intended for use in an immunoassay for the detection of membrane- 
5 bound CEG proteins on viable bacterial cells should be directed to an accessible epitope 
on membrane-bound CEG proteins. Antibodies that recognize other epitopes may be 
useful for the identification of CEG protein within damaged or dying cells, for the 
detection of secreted CEG protein or fragments thereof. 

10 Various methods for the preparation of antibodies are well known in the art. For example, 
antibodies may be prepared by immunizing a suitable mammalian host using a CEG protein, 
peptide, or fragment, in isolated or immunoconjugated form (Harlow, 1 989 Antibodies, Cold 
Spring Harbor Press, NY), In addition, fusion proteins comprising CEG polypeptides may 
also be used, such as a CEG protein/GST-fusion protein. Cells expressing or overexpressing 

15 a CEG polypeptide may also be used for immunizations. Similarly, any cell engineered to 
express CEG protein may be used. This strategy may result in the production of monoclonal 
antibodies with enhanced capacities for recognizing endogenous CEG protein. 

The present invention contemplates chimeric antibodies that comprise a human and non- 
20 human immunoglobin portion. The antigen combining region (variable region) of a 
chimeric antibody can be derived from a prokaryotic source (e.g., bacteria) and the 
constant region of the chimeric antibody which confers biological effector function to the 
immunoglobulin can be derived from a eukaryotic source (e.g., human). The chimeric 
antibody should have the antigen binding specificity of the prokaryotic antibody 
25 molecule and the effector function conferred by the eukaryotic antibody molecule. 

In one example, the procedure used to produce chimeric antibodies can involve the 
following steps: 

a) Identifying and cloning the correct immunoglobin gene segment encoding the 
30 antigen binding portion of the antibody molecule. This gene segment is known as 

the VDJ, variable, diversity and joining regions for heavy chains or VJ, variable, 
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joining regions for light chains or simply as the V or variable region. This gene 
regions may be in either the cDNA or genomic form; 

b) Cloning the gene segments encoding the constant region or desired part thereof; 

c) Ligating the variable region with the constant region so that the complete chimeric 
5 antibody is encoded in a form that can be transcribed and translated; 

d) Ligating this construct into a vector containing a selectable marker and gene control 
regions such as promoters, enhancers and poly(A) addition signals; 

e) Amplifying this construct in bacteria; 

f) Introducing this DNA into eukaryotic cells (transfection) most often mammalian 
10 lymphocytes; 

g) Selecting for cells expressing the selectable marker; 

h) Screening for cells expressing the desired chimeric antibody; and 

k) Testing the antibody for appropriate binding specificity and effector functions. 

15 Chimeric antibodies of several distinct antigen binding specificities have been produced 
by protocols well known in the art, including anti-TNP antibodies (Boulianne et al., 1984 
Nature 312:643); and anti-tumor antigen antibodies (Sahagan et al., 1986 J, Immunol 
137:1066). Likewise, several different effector functions have been achieved by linking 
new sequences to those encoding the antigen binding region. Examples of these include 

20 enzymes (Neuberger et al., 1984 Nature 312:604); immunoglobulin constant regions 
from another species and constant regions of another immunoglobulin chain (Sharon et 
al., 19Z4 Nature 309:364; Tan et al., 1985 J. Immunol 135:3565-3567). Additionally, 
procedures for modifying antibody molecules and for producing chimeric antibody 
molecules using homologous recombination to target gene modification have been 

25 described (Fell et al., 1989 Proa Natl Acad Set USA 86:8507-85 1 1). 

The predicted amino acid sequence of a CEG protein may be used to select specific regions 
of the CEG protein for generating antibodies. For example, hydrophobicity and 
hydrophilicity analyses of a CEG polypeptide may be used to identify hydrophobic and 
•30 hydrophilic regions in the CEG protein. Regions of the CEG protein that show 
immunogenic structure, as well as other regions and domains, can readily be identified using 
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various other methods known in the art, such as Chou-Fasman, Gamier-Robson , Kyte- 
Doolittle, Eisenberg, Karplus-Schult or Jameson-Wolf analysis. Fragments that include the 
immunogenic regions are particularly suited for generating specific classes of antibodies. 

5 Methods for preparing a protein for use as an immunogen and for preparing immunogenic 
conjugates of a protein with a carrier such as BSA, KLH, or other carrier proteins are well 
known in the art. In some circumstances, direct conjugation using, for example, 
carbodiimide reagents may be used; in other instances linking reagents such as those 
supplied by Pierce Chemical Co., Rockford, EL, may be effective. Administration of a CEG 
1 0 immunogen is conducted generally by injection over a suitable time period and with use of a 
suitable adjuvant, as is generally understood in the art. During the immunization schedule, 
titers of antibodies can be taken to determine adequacy of polyclonal antibody formation. 

While the polyclonal antisera produced in this way may be satisfactory for some 
15 applications, for pharmaceutical compositions, monoclonal antibody preparations are 
preferred. Immortalized cell lines which secrete a desired monoclonal antibody may be 
prepared using the standard method of Kohler and Milstein {Nature 256: 495-497) or other 
techniques as described in Monoclonal Antibodies; A Manual of Techniques, CRC press, 
Inc., Boca Raton, Fla. '(1987) ed. Zola. The immortalized cell lines secreting the desired 
20 antibodies are screened by immunoassay in which the antigen is the CEG polypeptide 
having binding activity, or a fragment thereof. When the appropriate immortalized cell 
culture secreting the desired antibody is identified, the cells can be cultured either in vitro or 
by production in ascites fluid. 

25 The desired monoclonal antibodies are then recovered from the culture supernatant or from 
the ascites supernatant Fragments of the monoclonal antibodies of the invention or the 
polyclonal antisera (e.g., Fab, FCab 1 ^ Fv fragments, fusion proteins) which contain the 
immunologically significant portion (i.e., a portion that recognizes and binds a CEG protein) 
can be used as antagonists, as well as the intact antibodies. Humanized antibodies directed 

30 against a CEG polypeptide are also useful. The advantage of using humanized antibodies is 
that they are less immunogenic in humans. As used herein, a humanized antibody is an 
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immunoglobulin molecule which is capable of binding to a CEG polypeptide and which 
comprises a FR region having substantially the amino acid sequence of a human 
immunoglobulin and a CDR having substantially the amino acid sequence of non-human 
immunoglobulin or a sequence engineered to bind a CEG protein. Methods for humanizing 
murine and other non-human antibodies by substituting one or more of the non-human 
antibody CDRs for corresponding human antibody sequences are well known (Jones et al., 
1986 Nature 321 : 522-525; Riechmnan et al., 1988 Nature 332: 323-327; Verhoeyen et al., 
1988 Science 239: 1534-1536; Carter et al., 1993 Proc. Natl Acad. Set USA 89: 4285; 
and Sims et al., 1993 J. Immunol 151: 2296). 

Use of immunologically reactive fragments, such as the Fab, Fab*, or F(ab')2 fragments is 
often preferable, especially in a therapeutic context, as these fragments are generally less 
immunogenic than the whole immunoglobulin. Further, bi-specific antibodies specific for 
two or more epitopes may be generated using methods generally known in the art. Further, 
antibody effector functions may be modified so as to enhance the therapeutic effect of the 
antibodies of the invention. For example, cysteine residues may be engineered into the Fc 
region, permitting the formation of interchain disulfide bonds and the generation of 
homodimers which may have enhanced capacities for internalization, ADCC and/or 
complement-mediated cell killing (Caron et al., 1992 J. Exp. Med 176: 1191-1195; 
Shopes, 1992 J. Immunol 148: 2918-2922). Homodimeric antibodies may also be 
generated by cross-linking techniques known in the art (Wolff et al., Cancer Res. 53: 2560- 
2565). The invention also provides pharmaceutical compositions having the monoclonal 
antibodies or anti-idiotypic monoclonal antibodies of the invention. 

The antibbdies or fragments may also be produced, using current technology, by 
recombinant means. Regions that bind specifically to the desired regions of the CEG 
protein can also be produced in the context of chimeric or CDR grafted antibodies of 
multiple species origin. The invention includes an antibody, e.g., a monoclonal antibody 
which competitively inhibits the immunospecific binding of any of the monoclonal 
antibodies of the invention to a CEG protein. 
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Alternatively, methods for producing fully human monoclonal antibodies, include phage 
display and transgenic methods, are known and may be used for the generation of human 
monoclonal antibodies (reviewed in: Vaughan et al., 1998 Nature Biotechnology 16: 535- 
539). For example, fully human monoclonal antibodies may be generated using cloning 
technologies employing large human Ig gene combinatorial libraries (i.e., phage display) 
(Griffiths and Hoogenboom, "Building an in vitro immune system: human antibodies from 
phage display libraries", in: Protein Engineering of Antibody Molecules for Prophylactic 
and Therapeutic Applications in Man, Clark, M. (Ed.), Nottingham Academic, pp 45-64 
(1993); Burton and Barbas, "Human Antibodies from combinatorial libraries" Id., pp 65- 
82). Fully human monoclonal antibodies may also be produced using transgenic mice 
engineered to contain human immunoglobulin gene loci as described in PCT Patent 
Application W098/24893, Jakobovits et al., published December 3, 1997 (see also, 
Jakobovits, 1998 Exp. Opin Invest Drugs 7: 607-614). This method avoids the in vitro 
manipulation required with phage display technology and efficiently produces high affinity, 
authentic human antibodies. 

The antibody or fragment thereof of the invention may be labeled with a detectable 
marker or conjugated to a second molecule, such as a therapeutic agent (e.g., a cytotoxic 
agent) thereby resulting in an immunoconjugate. For example, the therapeutic agent 
includes, but is not limited to, an anti-tumor drug, a toxin, a radioactive agent, a cytokine, 
a second antibody or an enzyme. Further, the invention provides an embodiment wherein 
the antibody of the invention is linked to an enzyme that converts a prodrug into a 
cytotoxic drug. 

Examples of cytotoxic agents include, but are not limited to ricin, ricin A-chain, 
doxorubicin, daunorubicin, taxol, ethiduim bromide, mitomycin, etoposide, tenoposide, 
vincristine, vinblastine, colchicine, dihydroxy anthracin dione, actinomycin D, diphteria 
toxin, Pseudomonas exotoxin (PE) A, PE40, abrin, arbrin A chain, modeccin A chain, 
alpha-sarcin, gelonin, mitogellin, retstrictocin, phenomycin, enomycin, curicin, crotin, 
calicheamicin, sapaonaria officinalis inhibitor, and glucocorticoid and other 
chemotherapeutic agents, as well as radioisotopes such as 212 Bi, 13I I, 13I In 3 90 Y, and 186 Re. 
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Suitable detectable markers for diagnostic used include, but are not limited to, a 
radioisotope, a fluorescent compound, a bioluminescent compound, chemiluminescent 
compound, a metal chelator or an enzyme. Antibodies may also be conjugated to an anti- 
5 cancer pro-drug activating enzyme capable of converting the pro-drug to its active form. 
See, for example, U.S. Patent Nos. 4,952,394 and 5,716,990. 

Additionally, a recombinant protein of the invention comprising the antigen-binding 
region of any of the monoclonal antibodies of the invention can be made. In such a 
10 situation, the antigen-binding region of the recombinant protein is joined to at least a 
functionally active portion of a second protein having therapeutic activity. The second 
protein can include, but is not limited to, an enzyme, lymphokine, oncostatin or toxin. 
Suitable toxins include those described above. 

15 Techniques for conjugating or joining therapeutic agents to antibodies are well known 
(Anion et al., "Monoclonal Antibodies For Immunotargeting Of Drugs In Cancer Therapy", 
in: Monoclonal Antibodies And Cancer Therapy, Reisfeld et al. (eds.), pp. 243-56, Alan R. 
Liss, Inc. 1985; Hellstrom et al., "Antibodies For Drug Delivery", in: Controlled Drug 
Delivery (2nd Ed.), Robinson et al. (eds.), pp. 623-53, Marcel Dekker, Inc. 1987; Thorpe, 

20 "Antibody Carriers Of Cytotoxic Agents In Cancer Therapy: A Review", in: Monoclonal 
Antibodies '84: Biological And Clinical Applications, Pinchera et al. (eds.), pp. 475-506 
(1985); and Thorpe et al., "The Preparation And Cytotoxic Properties Of Antibody-Toxin 
Conjugates", in: Immunol Rev., 62:119-58 (1982)). Techniques for joining detectable 
markers to antibodies are also known. 

25 

D) PHARMACEUTICAL COMPOSITIONS OF THE INVENTION 

The invention includes pharmaceutical compositions for use in the treatment of microbial 
infections comprising a pharmaceutical^ effective amount of an anti-CEG antibody or a 
30 CEG polypeptide. 
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In one embodiment, the pharmaceutical compositions may comprise a CEG antibody, 
either unmodified, conjugated to a therapeutic agent (e.g., drug, toxin, enzyme or second 
antibody) or in a recombinant form (e.g., chimeric or bispecific). The compositions may 
additionally include other antibodies or conjugates (e.g., an antibody cocktail). 

5 

The pharmaceutical compositions also preferably include suitable carriers and adjuvants 
which include any material which when combined with the molecule of the invention 
(e.g., an anti-CEG antibody or a CEG protein) retains the molecule's activity and is non- 
reactive with the subject's immune systems. Examples of suitable carriers and adjuvants 

10 include, but are not limited to, human serum albumin, ion exchangers, alumina, lecithin, 
buffer substances such as phosphates, glycine, sorbic acid, potassium sorbate, and salts or 
electrolytes such as protamine sulfate. Other examples include any of the standard 
pharmaceutical carriers such as a phosphate buffered saline solution, water, emulsions 
such as oil/water emulsion, and various types of wetting agents. Other carriers may also 

15 include sterile solutions, tablets including coated tablets and. capsules. Typically such 
carriers contain excipients such as starch, milk, sugar, certain types of clay, gelatin, 
stearic acid or salts thereof, magnesium or calcium stearate, talc, vegetable fats or oils, 
gums, glycols, or other known excipients. Such carriers may also include flavor and 
color additives or other ingredients. Compositions comprising such carriers are 

20 formulated by well known conventional methods. Such compositions may also be 
formulated within various lipid compositions, such as, for example, liposomes as well as 
in various polymeric compositions, such as polymer microspheres. 

The pharmaceutical compositions of the invention can be administered using 
25 conventional modes of administration including, but not limited to, intravenous, 
intraperitoneal, oral, ijitralymphatic or administration directly into the tumor. 
Intravenous administration is preferred. 

The pharmaceutical compositions of the invention may be in a variety of dosage forms 
30 which include, but are not limited to, liquid solutions or suspensions, tablets, pills, 
powders, suppositories, polymeric microcapsules or microvesicles, liposomes, and 
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injectable or infusible solutions. The preferred form depends upon the mode of 
administration and the therapeutic application. 

The CEG polypeptides and proteins of this invention are found in common pathogenic 
5 bacterial species such as Streptococcus pneumoniae. This organism causes upper 
respiratory tract infections. Thus, the peptides and proteins of this invention can be used 
as immunogens-in subunit vaccines for vaccination against a pathogenic bacteria such as 
Streptococcus pneumoniae. Additionally, the ceg sequences of the invention can be used 
as DNA vaccines (U.S. Patent No. 5,736,524 and U.S. Patent No. 5,989,553). 

10 

The polypeptides and proteins of this invention can be formulated , as univalent and 
multivalent vaccines. The protein can be mixed, conjugated or fused with other antigens, 
including B or T cell epitopes of other antigens. 

15 Further, when a haptenic peptide of the proteins of the invention is used, (i.e., a peptide 
which reacts with cognate antibodies, but cannot itself elicit an immune response), it can 
be conjugated to an immunogenic carrier molecule. Conjugation to an immunogenic 
carrier can render the oligopeptide immunogenic. Examples of carrier molecules are 
tetanus toxin or toxoid, diphtheria toxin or toxoid and any mutant forms of these proteins 

20 such as CRM.sub!l97. Others include exotoxin A of Pseudomonas, the heat labile toxin 
of E. coli and rotaviral particles (including rotavirus and VP6 particles). Alternatively, a 
. fragment or epitope of the carrier protein or other immunogenic protein can be used. For 
example, the happen can be coupled to a T cell epitope of a bacterial toxin. 

25 In formulating the vaccine compositions with the CEG polypeptides or proteins of the 
invention, alone or in the various combinations described, the immunogen is adjusted to 
an appropriate concentration and formulated with any suitable vaccine adjuvant. Suitable 
adjuvants include, but are not limited to: surface active substances, e.g., hexadecylamine, 
octadecylamine, octadecyl amino acid esters, lysolecithin, dimethyl- 

30 dioctadecylammonium bromide), methoxyhexadecylgylcerol, and pluronic polyols; 
polyamines, e.g., pyran, dextransulfate, poly. IC, carbopol; peptides, e.g., muramyl 
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dipeptide, dimethylglycine, tuftsin; oil emulsions; and mineral gels, e.g., aluminum 
hydroxide, aluminum phosphate, etc. and immune stimulating complexes. The 
immunogen may also be incorporated into liposomes, or conjugated to polysaccharides 
and/or other polymers. 

5 

The vaccines can be administered to a human or animal in a variety of ways. These 
include intradermal, intramuscular, intraperitoneal, intravenous, subcutaneous, oral and 
intranasal routes of administration. Further, the vaccines can be live or inactivated 
vaccines. 

10 

The most effective mode of administration and dosage regimen for the compositions of 
this invention depends upon the severity and course of the disease, the patient's health 
and response to treatment and the judgment of the treating physician. Accordingly, the 
dosages of the compositions should be titrated to the individual patient. 

15 

E) USES OF THE MOLECULES OF THE INVENTION 

1) MOLECULAR WEIGHT MARKERS 

20 The nucleic acid molecules of the invention and their encoded proteins may be employed 
as molecular weight markers. For example, the molecular weight of each of the nucleic 
acid molecules having ceg sequences and their predicted polypeptides can be determined 
and can be used to compare against other gene sequences and proteins whose molecular 
weights are unknown. 

25 

2) DIAGNOSTICS 

The nucleic acid molecules of the invention may be employed in diagnostic 
embodiments. For example, the presence of nucleotide sequences which are identical or 
30 similar to the ceg sequences of the invention may be detected within a biological sample. 
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The biological sample may include blood, serum or a swab from nose, ear or throat, may 
be determined by means of a nucleic acid detection assay. 

Nucleic acid probes or primers having sequences complementary to ceg sequences may 
5 be used in a hybridization assay to detect the presence of the sequences which are 
identical or similar to the ceg sequences of the invention in the biological samples. 
Typically, nucleic acids molecules obtained from a suitable biological sample are 
hybridized with labeled probes or primers. The resulting hybridized molecules are 
detected and resolved by methods well known in the art , such as Northern or Southern 
10 blotting, micro-array technology, or amplifying with PCR technology. Other 
hybridization techniques and systems are known that can be used in connection with the 
detection aspects of the invention, including diagnostic assays such as those described in 
Falkow et al., U.S. Pat. No. 4,358,535. 

15 Examples of the PCR technology are disclosed in U.S. Patent Nos. 4,683,202 and 
4,965,188 (incorporated herein by reference). Generally, nucleic acid molecules are 
obtained from a suitable biological source and contacted with two primers corresponding 
to the ceg sequences disclosed herein, under conditions which allow for hybridization and 
polymerization to occur. A pair of probes, one corresponding to the 5' flanking region 

20 and the other corresponding to the 3' flanking region, would be sufficient to detect the 
nucleic acid molecules of the invention in a biological sample and may be used to 
indicate the amount of bacteria present. 

Alternative methods of detecting nucleic acid molecules include, for example, in situ 
25 hybridization techniques, where a ceg probe is used to detect homologous sequences 
within one or more cells, such as cells within a clinical sample or even cells grown in 
tissue culture. As is well known in the art, the cells are prepared for hybridization by 
fixation, e.g. chemical fixation, and placed in conditions that allow for the hybridization 
of a detectable probe with nucleic acids located within the fixed cell. 

30 
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The amount of ceg sequences present in a biological sample can be quantified and 
compared to the levels in a normal or "healthy" sample. For example, ceg sequences 
present in either increased or decreased levels, compared to the levels found in the 
control sample may indicate the presence of bacteria. This information is useful for 
5 * diagnosis of a bacterial infection that requires treatment with an antibacterial agent. 

Alternatively, the amount of CEG polypeptides present in a biological sample may be 
determined by means of an immunoassay. For example, labeled antibodies reactive 
against CEG polypeptides may be used in an immuno-reactive assay to detect the 
1 0 presence of CEG polypeptides in the biological samples. 

3) SCREENING CANDIDATE CEG SEQUENCES 

a) Gene Disruption Assay 

15 

The ceg nucleotide sequences of the invention can be used to identify nucleotide 
sequences which are identical or similar to the ceg sequences that are required for 
bacterial cell viability. For example, the ceg sequences can be used in a bacterial gene 
disruption assay to screen candidate nucleotide sequences to identify sequences required 
20 for bacterial cell viability. 

The disruption assay can involve: introducing into a host cell a recombinant vector that is 
capable of integration into the host genome, where the recombinant vector, includes a 
candidate sequence that putatively encodes a cell-viability gene product (e.g., the 
25 exogenous ceg sequence); the vector integrates the candidate sequence into a target 
sequence within the host's genome (e.g., the endogenous ceg sequence); and the host cell, 
so introduced, is screened for viability. The recombinant vector preferably includes a 
selectable marker so that the introduced host cell can be screened for viability in the 
presence of a selectable agent. 

30 
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For example, Figure 1 shows a schematic representation of a gene disruption assay, 
within a bacterial host cell. In Figure 1 A, the recombinant vector, pEVP3, includes the 
CAT gene (e.g., the selectable marker chloramphenicol acetyl transferase) and an internal 
region of the ceg disrupting sequence; the internal region excludes the 5' and 3' ends of 
the ceg sequence. The "X" in Figure 1 indicates the recombinant pEVP3 vector undergoing 
homologous recombination with the target sequence (e.g., within the host genome). In 
Figure IB, the resolved pEVP3 vector that is integrated into the host genome, is shown. 
Left to right are the following elements: the native promoter of the target gene; a 5' partial 
copy of the target gene; the body of the integrated pEVP3 vector including the disrupting 
gene and CAT; and, a 3' partial copy of the target gene. Thus, integration of the pEVP3 
vector via homologous recombination results in two partial gene duplications flanking the 
integrated vector. If the target gene is not essential for survival, it is possible to recover 
chloramphenicol-resistant colonies of S. pneumoniae. Failure to recover chloramphenicol 
resistant colonies, in the presence of the proper controls as described below, indicates that 
the target gene may be essential for cell viability. 

More particularly, the gene disruption assay for screening candidate ceg sequences can 
involve the following steps. The recombinant pEVP-3. vector encoding CAT resistance 
and having a fragment of a candidate ceg sequence, can be introduced into 
transformation-competent S. pneumoniae cells by methods that are well-known in the art 
(Lee, M.S., et al, 1998 Appl Environ Microbiol 64:4796-4802). The preferred size of 
the ceg fragment can be between about 200 to about 500 bp in length. It is advantageous 
that the candidate ceg sequence does not include the 5 5 and 3' ends that encode the N- 
and C-terminal ends of the CEG polypeptide. This insures that the inserted ceg fragment 
and the disrupted endogenous ceg gene sequence are not capable of expression of a full- 
length, functional ceg gene product. The transformation-competent cells can be obtained 
by performing the transformation step in the presence of a heptadecapeptide that induces 
competence for transformation of S. pneumoniae (Havarstein, L. S., et al., 1995 Proc. 
Natl Acad Sci. 92:11140-11144), such as the CSP-1 peptide. The CSP-1 can be 
naturally-derived or synthetic. Additionally, the transformation step can.be optimized by 
performing the transformation when the cells have reached a density which is optimal for 
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transformation (e.g., 3 X 10 7 cells per ml.) (Havarstein, L. S. et al. supra). The 
recombinant vector can be introduced into the competent pneumococci and may undergo 
homologous recombination, whereby the candidate ceg fragment recombines with the 
corresponding endogenous ceg sequence, resulting in targeted integration of the vector 
5 into the pneumococcal genome and disruption of the endogenous ceg. 

The transformed cells can be plated on or cultured in chloramphenicol-containing growth 
medium. The cells can be cultured under standard conditions, such as 37° C in 5% CO2 
for approximately 40 to 48 hours, for the purpose of selecting cells that cany the 
1 0 integrated vector. 

Additionally, control samples can be run in parallel with the gene disruption assay, in 
order to determine whether the gene disruption procedure is working properly. For 
example, the control samples can be used to calibrate the gene disruption experiment so 

15 that disruption of a known non-essential bacterial gene results in an approximate number 
of colonies per plate. Similarly, the disruption of a known essential gene can be 
calibrated to yield only zero or one colony per plate. The appearance of one colony is 
due to the rare illegitimate recombination into a non-homologous sequence. In particular, 
a known non-essential gene such as the lytA gene (Tomasz, A., et al., 1988 J. Bacteriol 

20 170:5931-5934) can be used so that between about 70 to 100 chloramphenicol-resistant 
colonies will grow per plate. Similarly, the ftsZ gene (Lutkenhaus, J. F., et al., 1980 J. 
Bacteriol 143:1281-1288), a known essential gene, can be used to yield zero or, rarely, 
one colony per plate. As is well known in the art, specific parameters that are involved in 
any given gene disruption assay can be adjusted to calibrate the desired number of plated 

25 cells in the control samples. Experimental parameters that can be adjusted include, but 
are not limited to, the E, coli strain used to propagate the vector/insert, the fragment 
length of the sequence to be integrated, the amount of recombinant integration vector 
used to transform the cells, use of transformation-competent cells, and plating density of 
the transformed cells. 

30 
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The transformed cells carrying the recombinant integration vector that disrupts 
expression of an endogenous essential gene (e.g., the target ceg gene) can be identified, 
based on a selectable phenotype such as non-viability. For example, the cells that carry a 
disrupted non-essential gene will be viable and, due to the integration of pEVP3, will 
grow on cMoramphenicol-containing medium. In contrast, cells that carry a disrupted 
essential gene will not grow (e.g., non-viable) on the chloramphenicol-containing 
medium. Thus, the transformed cells that do not grow under these selective conditions 
carry an endogenous gene sequence that is essential for cell viability which has been 
disrupted by an exogenous candidate fragment, thereby identifying a ceg sequence. Steps 
one through three may be repeated in order to confirm that the ceg sequences, so 
identified, are essential for cell viability. 

b) Autolysin Assay 

It is advantageous to perform additional steps to determine whether the homologous 
recombination events result in disruption of the intended target gene sequence. The lytA 
transformation control can be used to confirm that the transformation system is 
functioning properly. For example, a phenotypic test for autolysin activity (lytA gene 
product) can be performed to determine that the exogenous lytA fragment is correctly 
integrated into the lytA site within the host genome. This typically involves flooding the 
culture plates containing transformants carrying the integrated lytA control vector with a 
solution of detergent, such as 0.1% deoxycholate, which triggers cell lysis in lytA -intact 
cells (e.g., the cells that have not undergone homologous recombination). After about 5- 
10 minutes the colonies with intact lytA will appear ghost-like due to cell lysis, and the 
colonies with a disrupted lytA gene will appear intact. 

c) Polarity Analysis 

The ceg sequences that are confirmed to be essential for cell viability can be examined 
further by performing a polarity analysis to determine if the corresponding endogenous 
ceg sequence is organized in an operon. Polarity is an effect unique to prokaryotes and is 
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the result of the operon organization of bacterial genomes. Many bacterial genes are 
arranged in operons in which multiple genes are under the control of a single regulatory 
sequence (e.g., a promoter) and are transcribed into a single mRNA transcript. With 
respect to the orientation of multiple genes within an operon, the genes that are proximal 
to the regulatory sequence are said to be "upstream" genes and the genes that are distal 
are said to be "downstream" genes. For example, many operons contain genes encoding 
different proteins that catalyze discrete steps of a common biochemical pathway. Thus, 
any of the proteins that catalyze the steps of the pathway may be essential for cell 
viability. 

The presence of operons in a bacterial host genome may influence the interpretations of 
the gene disruption results. For example, disruption of an upstream gene may be 
erroneously interpreted as affecting the expression of the disrupted gene but may, in fact, 
have expression affects on the intact downstream genes. Therefore, it is advantageous to 
perform a polarity analysis to determine if a ceg sequence is part of an operon. 

A polarity analysis can involve performing an in vivo gene disruption procedure usin& as 
the disrupting sequence, a ceg sequence that includes the entire ceg coding sequence 
region but lacking expression regulatory sequences. This differs from the gene disruption 
assay, which involves the central region of the ceg sequence. The polarity analysis 
involves gene duplication via homologous recombination. For example, the pEVP-3 
vector having the entire coding region of a ceg sequence can be used for the polarity 
analysis (Figure 2 A). The polarity analysis will yield different results depending on the 
organization of the endogenous target sequence within the host genome. 

For example, Figure 2 shows a schematic representation of the polarity test for operons, 
within a bacterial host cell. In Figure 2A, the recombinant vector, pEVP3, includes the 
CAT gene and the entire coding region of the ceg disrupting sequence. The "X" in Figure 
2 indicates the recombinant pEVP3 vector undergoing homologous recombination with the 
target sequence. Two of the possible results of homologous recombination are shown in 
Figures 2 B and C. 
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In Figure 2 B, case 1, if the endogenous target sequence is not organized in an operon, the 
integration event may yield: a functional target sequence (e.g., it is capable of 
expression); a duplicate non-functional target sequence that lacks a promoter; and a 
5 functional downstream gene (e.g., Gene B) that is controlled by its own promoter. The 
cells carrying this type of integrated target sequence can be recovered as viable cells that 
grow in the presence of chloramphenicol; this condition is termed "polarity negative". 

In Figure 2 C, case 2, if the target sequence is organized in an operon, then the integration 
10 event may yield an integration site that is similar to that described for case 1 , including: a 
functional target sequence; and a duplicate non-functional target sequence which is not 
functional. However, this integration event may also yield a non-functional downstream 
gene (e.g., Gene B) because expression of this downstream gene is controlled by a 
promoter located upstream of the insertion site. The cells that carry this type of 
15 integrated target sequence will be non-viable; this condition is termed "polarity positive". 
Thus, the polarity analysis provides a method to determine whether integration of a 
. recombinant vector into a target ceg sequence effects expression of downstream genes. 

The ceg sequences disclosed herein (SEQ ID NOs.: 1-113, 227-331) encode gene 
20 products that are essential for viability in S. pneumoniae. Furthermore, many of these 
ceg sequences have been analyzed for the polarity effect and the results are presented in 
Table I. One subset of ceg sequences is classified as polarity negative (-), since the 
homologous recombination event did not effect the expression of downstream genes. 
Another subset of ceg sequences is classified as polarity positive (+), since the 
25 homologous recombination event did affect the expression of downstream genes. The 
ceg sequences that have not yet been classified as polarity positive or negative are 
indicated in Table I as a blank. For the ceg sequences that are classified as polarity 
positive, the genes downstream of the disrupted endogenous ceg sequences may or may 
riot also be essential. 

30 
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4) ASSAYS FOR IDENTIFYING CEG LIGANDS AND OTHER 
BINDING AGENTS 

The present invention provides screening methods for identifying agents that interact 
and/or bind to the CEG proteins of the invention, such as a iigand. An agent can be, for 
example, a natural product, a derived or synthetic chemical molecule, a polypeptide, a 
nucleic acid molecule, or a metal. The agents that interact with CEG proteins may cause 
bacterial cell death by disrupting the functions of CEG proteins, including, but not 
limited to, nucleotide biosynthesis, DNA replication, RNA transcription, protein 
translation, and/or cell wall biosynthesis. Accordingly, the present invention provides 
screening methods for identifying agents having antibacterial activity, such as agents that 
cause bacterial cell death by interacting with the CEG proteins. These antibacterial 
agents are useful for treating diseases and afflictions associated with bacterial infections. 

Various methods can be used to discover agents having antibacterial activity, as 
determined by the ability of the binding agent to bind to a CEG protein and disrupt the 
function of the CEG protein. These screening methods include whole cell in vivo assays 
as well as in vitro assays with cellular components. 

An in vivo screening method for identifying ligands that bind CEG polypeptides can be 
performed in a whole cell assay. A typical method may be the use of whole bacterial 
cells to assess the antibacterial properties based on cell growth or viability. These 
methods can include methods for measuring cell growth and/or viability, for example, by 
optical density or zones of growth (Koch, A. L. et al., 1970 Anal. Biochem. 38:252-259; 
Biemer, J. J. et al., 1973 Ann. Clin. Lab. Set 2:135-140; Manual of Clinical 
Microbiology; 7 th edition, Murray, P. R. (ed), ASM Press), by growth inhibition in an 
agar assay (Murray, P. R., supra), or other means of detecting cell metabolism 
(Mychajlonka, M. et al., 1980 Antimicrob. Agents Chemother. 17:572-582), and are well 
known to those skilled in the art. In addition, there are molecular biology-based detection 
methods for use with whole bacterial cells, such as gene reporter assays, to monitor the 
effect of the ligand on specific targets (Slauch, J. M., et al., 1991 Methods Enzymol. 
204:213-248). Examples of the reporter genes include, but are not limited to, beta- 
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galactosidase, alkaline phosphatase, luciferase, and green fluorescent protein. For 
example, one embodiment provides a reporter system that monitors inhibition of DNA 
synthesis by fusing a reporter such as beta-galactosidase (lacZ) to genes known to be 
upregulated by the cessation of DNA synthesis as a result of the binding of ligands to the 
5 DNA synthetic apparatus. (Shurvinton, C. E., et al., 1982 Mol Gen. Genetics 185:352- 
355; Rosato, A., et al., 1998 Antimicrob. Agents Chemother. 42:1392-1396). 

Alternatively, thb yeast two-hybrid system (Fields, S. and Song, O. 1989, Nature 
340:245-246) may be adapted to screen for ligands that bind CEG polypeptides. Generally, 

10 the yeast two-hybrid system is performed in a yeast host cell carrying a reporter gene, and 
is based on the modular nature of the GAL transcription factor which has a DNA binding 
domain and a transcriptional activation domain. The yeast two-hybrid system relies on 
the physical interaction between a recombinant polypeptide that comprises the GAL 
DNA binding domain and another recombinant polypeptide that comprises the GAL 

15 transcriptional activation domain. The physical interaction between the two recombinant 
polypeptides reconstitutes the transcriptional activity of the transcription factor, thereby 
causing expression of the reporter gene. Either of the recombinant polypeptides used in 
the two-hybrid system can be generated to include a CEG polypeptide sequence to screen 
for binding partners of CEG. 

20 

Another method uses the bacterial CEG proteins as the basis for in vitro assay systems to 
detect binding agents. Typically, the in vitro screening method comprises: a) generating 
the CEG protein of the invention, or membranes enriched in the CEG protein; b) 
exposing the CEG protein or membranes to a candidate agent; and c) detecting the 
25 interaction of the CEG protein with the agent by any suitable means. Additionally, the 
screening methods may be adapted to automated high-throughput procedures, such as 
PANDEX.RTM Baxter-Dade Diagnostics, allowing for efficient high-volume screening 
of candidate agents. 

30 An alternative method for screening potential ligands involves an in vitro binding 
procedure. Typically, the CEG proteins of the invention can be produced using 
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recombinant DNA technology and host-vector systems as described herein. A candidate 
agent is introduced into a reaction vessel containing the CEG protein, of fragment 
thereof; the candidate agents may be detectable by methods such as, but not limited to, 
radioisotope or chemical labeling. Binding of the CEG protein by a candidate agent can 
be determined by any suitable means, including, for example, quantifying bound label 
versus unbound label using any suitable method. Binding of a candidate agent may also 
be detected by methods similar to an alternative physical method disclosed in U.S. Patent 
No. 5,585,277. In this method, binding of a candidate agent to a protein is assessed by 
monitoring the ratio of folded protein to unfolded protein, for example by monitoring 
sensitivity of the protein to a protease, or amenability to binding of the protein by a 
specific antibody against the folded state of the protein, or binding to chaperone protein, 
or by binding to any suitable surface. 

The invention provides methods of identifying compounds that modulate (e.g., activate or 
inhibit) the function of a CEG polypeptide. Essentially any compound can be used in the 
assays of the invention. The preferred compounds are those that are soluble in aqueous 
or organic solutions. It will be appreciated by those of skill in the art that there are many 
commercial suppliers of chemical compounds that can be used in the methods of the 
invention, including Sigma Chemical Co. (St. Louis, Mo.), Aldrich Chemical Co. (St. 
Louis, Mo.), Sigma-Aldrich (St. Louis, Mo.), Fluka Chemika-Biochemica Analytika 
(Buchs, Switzerland), and the like. 

The present invention provides methods for detecting compounds which are identified as 
modulators of CEG function. The methods of the invention can be performed using 
isolated CEG polypeptides, or use whole cells expressing the CEG polypeptide. The 
steps, of the method using isolated CEG polypeptides include: contacting the isolated 
CEG polypeptide with a candidate compound; and determining whether the function of 
the CEG polypeptide is altered. The steps of the method using whole cells include: 
contacting the whole cells with a candidate compound; and determining whether the cell 
dies, indicating the compound inhibited the function of a CEG polypeptide. 
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The preferred methods of the invention provide high-throughput screening assays for 
identifying compounds which modulate the function of a CEG polypeptide. The high 
throughput methods permit screening of large libraries of compounds. For example the 
high throughput methods can use automated assay steps. The assays can be performed 
5 in parallel on a solid support, as microtiter formats on microtiter plates in robotic assays 
are well known. A preferred embodiment of the methods includes adapting the methods 
to use microtiter plates or pico- nano- or micro-liter arrays. In high throughput assays it 
is desirable to run positive controls to ensure that the components of the assays are 
. working properly. 

10 

The high throughput screening methods of the invention include providing a 
combinatorial library containing a large number of compounds (candidate modulator 
compounds) (Borman, S, C. & E. News, 1999, 70(10), 33-48). Such combinatorial 
chemical libraries can be screened in one or more assays to identify library members 
15 (particular chemical species or subclasses) that exhibit the ability to modulate the 
function of the CEG polypeptide (Borman, S., supra; Dagani, R. C.& E. News, 1999, 
70(10), 51-60). The compounds, so identified, can serve as lead-compounds or can 
themselves be used as potential or actual therapeutics. 

20 A combinatorial chemical library is a collection of diverse chemical compounds 
generated by using either chemical synthesis or biological synthesis, to combine a 
number of chemical building blocks, such as reagents. For example, a linear 
combinatorial chemical library, such as a polypeptide library, is formed by combining a 
set of chemical building blocks (amino acids) in every possible way for a given 

25 compound length (i.e., the number of amino acids in a polypeptide compound). Millions 
of chemical compounds can be synthesized through such combinatorial mixing of 
chemical building blocks. 

Preparation and screening of combinatorial chemical libraries is well known to those of skill in 
30 the art. Such combinatorial chemical libraries include, but are not limited to, peptide libraries 
(see, e.g., U.S. Pat. No. 5,010,175, Furka, Int. 1 Pept. Prot Res., 1991, 37:487-493 and 
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Houghton, et al., Nature, 1991, 354, 84-88). Other chemistries for generating chemical 
diversity libraries can also be used. Such chemistries include, but are not limited to, peptoids 
(PCT Publication No. WO 91/19735); encoded peptides (PCT Publication WO 93/20242); 
random bio-oligomers (PCT Publication No. WO 92/00091); benzodiazepines (U.S. Pat No. 
5,288,514); diversomers, such as hydantoins, benzodiazepines and dipeptides (Hobbs, et al., 
Proc. Nat Acad Sci. USA, 1993, 90, 6909-6913); vinylogous polypeptides (Hagihara, et al., J. 
Amer. Chem. Soc. 1992, 114, 6568); nonpeptidal peptidomimetics with 6eta-D-glucose 
scaffolding (Hirschmann, et al., 1 Amer. Chem. Soc, 1992, 114, 9217-9218); analogous 
organic syntheses of smallcompound libraries (Chen, et al., J. Amer. Chem. Soc, 1994, 116, 
2661; Armstrong, et al. Acc. Chem. Res,, 1996, 29, 123-131); or small organic molecule 
libraries (see, e.g., benzodiazepines, Baum C&E News ? 1993, Jan. 18, page 33,); 
oligocarbamates (Cho, et al., Science, 1993, 261, 1303); and/or peptidyl phosphonates 
(Campbell, et al., 1 Org. Chem. 1994, 59, 658); nucleic acid libraries (see, Seliger, H et al., 
Nucleosides & Nucleotides, 1997 r , 16, 703-710); peptide nucleic acid libraries (see, e.g., U.S. 
Pat. No. 5,539,083); antibody libraries (see, e.g., Vaughn, et al., Nature Biotechnology, 1996, 
14(3), 309-314 and PCT/US96/10287); carbohydrate libraries (see, e.g., Liang, et al., Science, 
1996, 274, 1520-1522 and U.S. Pat. No. 5,593,853, Nilsson, UJ, et al., Combinatorial 
Chemistry & High Throughput Screening, 1999 2, 335-352; Schweizer, F; Hindsgaul, O. 
Current Opinion In Chemical Biology, 1999 3, 291-298); isoprenoids (U.S. Pat. No. 
5,569,588); thiazolidinones and metathiazanones (U.S. Pat. No. 5,549,974); pyrrolidines (U.S. 
Pat. Nos. 5,525,735 and 5,519,134); moipholino compounds (U.S. Pat No. 5,506,337); 
benzodiazepines (U.S. Pat No. 5,288,514); and other similar art. 

Devices for the preparation of combinatorial libraries are commercially available (see, 
e.g., 357 MPS, 390 MPS, Advanced Chem. Tech, Louisville Ky., Symphony, Rainin, 
Woburn, Mass., 43 3 A Applied Biosystems, Foster City, Calif., 9050 Plus, Millipore, 
Bedford, Mass.). In addition, numerous combinatorial libraries are themselves 
commercially available (see, e.g., ComGenex, Princeton, N.J., Asinex, Moscow, Ru, 
Tripos, Inc., St. Louis, Mo., ChemStar, Ltd., Moscow, RU, 3D Pharmaceuticals, Exton, 
Pa., Martek Bio sciences, Columbia, Md., etc.). 
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In the high throughput methods of the invention, several thousand different candidate 
compounds can be screened in a relatively short period of time. For example, each well 
of a microtiter plate can be used to run a separate assay against a selected potential 
modulator, or if concentration or incubation time effects are to be observed, every 5-10 
wells can test a single modulator. Thus, a single standard microtiter plate can assay about 
100 (96) modulators. If 1536 well plates are used, then a single plate can easily assay 
from about 100 to about 1500 different compounds. It is possible to assay many different 
plates per day; assay screens for up to about 6,000-20,000, and even up to about 100,000- 
1,000,000 different candidate modulator compounds are possible using the methods of 
the invention. 

The following examples are presented to illustrate the present invention and to assist one of 
ordinary skill in making and using the same. The examples are not intended in any way to 
otherwise limit the scope of the invention. 

EXAMPLE 1 

The following provides a general description of how a list of candidate ceg sequences 
was generated. The list was generated by selecting candidate ceg gene sequences from a 
Concordance web engine using the method described in: Bruccoleri, R.E., Dougherty, 
T.J., Davison, D.B. (1998) "Concordance analysis of microbial genomes" in: Nucleic 
Acids Res 26:4482-4486. 

Microbial Genomics CEG Discovery Process Summary. 
Microbial Concordance Analysis 

The entire genomic sequence data of various bacteria was acquired from several public 
and proprietary sequence database sources, including GTC (Genome Therapeutics 
Corporation), and TIGR (The Institute for Genomic Research). 
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Predicted ORFs from the genomic data were identified, translated, and stored;, vffiefe 
desirable ORFs were v at*least 90 amino acid residues in length. Concordance aS|lysis v 1 * 
was performed among bacteria and various parameters were used to filter out genes with 
high similarity to euk&ryotes. 



Concordance Analysis 

The entire genomic sequence of various Eiibacteria was acquired from several public and 
private sources. The proprietary PathoGenome System from Genome Therapeutics 

10 Corporation, Waltham, MA, USA contributed data. Public data was obtained from 
GenBank fhttp://ncbi.nlm.nih.gov\ The Institute for Genomic Research (TIGR), the 
Yeast Proteome Database, from Proteome, Inc. of Beverly, MA, and the Sanger Center of 
the Medical Research Council of the United Kingdom (http://www.sanger.ac.uk). 
Additionally, the non-microbial sequence data used as a basis for comparison and data 

15 subtraction was obtained from a proprietary database, including the LifeSeq Database 
from Incyte Pharmaceuticals, Palo Alto, CA. 

Where required, Incyte nucleotide sequences were translated into protein sequences in all 
six possible reading frames. GTC supplied predicted protein sequences with their data. In 
20 the case of other eubacterial nucleotide sequences, the projgram CRITIC A (Badger, J. and 
Olsen, G., 1999 "CRITICA: coding region identification tool invoking comparative 
analysis" in: Molecular Biology and Evolution 16:512-524). The sequences were stored 
in flat files on a Unix computer system. Each predicted amino acid sequence had to be 
greater than 90 amino acids. 

25 

Each predicted protein sequence was compared to every other sequence (an "all-agairist- 
all" comparison). The program used was FASTA (Pearson, W.R., "Flexible sequence 
similarity searching with the FASTA3 program package." Methods in Molecular Biology 
2000 132:185-219.) The parameters used were ktup=2, and all scores above the default 
30 cutoff were kept. The output was processed and stored in a PostGres 95 database 
(htt p://www.postgresqI.org) . Graphical user interfaces, using web browser technology, 
were constructed to query the database. 
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A Concordance Analysis was performed on the data. The question used to generate the 
dataset was show all Streptococcus pneumoniae open reading frames with a similarity 
5 greater than or equal to 30% overall protein sequence identity to both selected gram- 
positive and/or gram-negative bacteria in the database. The data was further required not 
to match yeast or human sequences at greater than 30% overall protein sequence 
similarity. The resulting dataset included a list of more than 400 conserved amino acid 
sequences having known or unknown function. The amino acid sequences having 
10 unknown functions formed the basis of a list designated Conserved Unknown Reading 
Frames, or CURFs which is a subset of the total list of CEGs (e.g., CURFs includes 
known and unknown). 

The resulting list of conserved genes (e.g., more than 400 sequences) was used as a basis 
15 for selecting and screening bacterial gene sequences that are essential for cell viability. 
The Concordance system was designed to permit high-throughput identification of 
conserved gene sequences in the database. (Bruccoleri, R, Dougherty, T, and Davison, D. 
1998 "Concordance analysis of microbial genomes" Nucleic Acids Res, 26:4482-4486.) 

20 Data Curation And Analysis 

Exact N-terminal and C-terminal translational start sites of genes were identified by 
pairwise similarity searches, multiple sequence alignments. Ribosome binding sites, 
terminators, nearby genes, operons were identified. 

25 

The resulting list of conserved genes was used as a basis for selecting and screening 
bacterial gene sequences that are essential for cell viability. This Concordance system 
was designed to permit high throughput use of the conserved gene sequences contained 
on the list. A set of Knockout PCR primers were generated, based on the list of 
30 conserved genes, for the purpose of use in the gene disruption procedure described 
below. The PCR primers were designed to amplify a central 300-500 bp region of the 
ceg (to prevent generation of a functional copy of the ceg gene following integration), 
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ordered electronically, the primers were placed in a 96-well format, and used in the gene 
disruption procedure as described below. 

EXAMPLE 2 

5 

The following provides a description of the procedure to generate recombinant vectors of 
pEVP-3 having inserts of candidate ceg nucleotide sequences. The Knockout primers 
generated by the method described in Example 1 above were used to generate DNA 
fragments comprising candidate ceg sequences. 

10 

Genomic PCR Knockout Target Fragment Generation 

96-well plate format were set up (36 ^1 H2O , 5 \xl 10x Vent™ buffer, 1 nl gene specific, 
knockout forward primer (0.5 tig/^1), 1 |il gene specific knockout reverse primer (0.5 
15 Hg/nl), 0.5 \il Vent™ DNA polymerase (2000 U/ml New England Biolabs, Beverly, 
MA), 1.5 jil each dNTPs (lOmM; 6.0 \xl total), 0.5 \xl S. pneumoniae chromosomal DNA 
(0.5 |wg/nl), 50 |il total volume/reaction). 

The nucleotide sequences of the forward and reverse knockout primer pairs were 
20 generated from the nucleotide sequence information obtained from the Genomic 
Therapeutics Corporation database for Streptococcus pneumoniae. The primer pairs were 
each used in a PCR reaction to generate a unique internal (e.g., central region) fragment 
of the candidate gene targeted for knockout. 

25 The PCR program was set in the PCR machine (Initial 95 °C - 5 minutes: 30 Cycles of: 
95 °C - 1 minute, 58 °C - 1 minute,. 72 °C - 30 seconds; Final, 72 °C - 10 minutes, 4 °C - 
hold indefinitely). 5 |il of each reaction was run on an 0.8% agarose gel after purifying 
fragment over PCR purification kit (Qiagen) to visualize the fragments then ligation 
reactions were performed. 

30 
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Ligation Reactions proceeded (set up in 96-well plate format (10.0 |al genomic PGR 
fragment (generated from step 2 above), 1.0 nl pEPV-3 Smal-cut vector (1:10 dilution of 
vector DNA at 50-100 ng/pl), 1.5 fil 10* ligation buffer (New England Biolabs™), 1.0 jxl 
T4 DNA Ligase (New England Biolabs™ 400,000 U/ml), 1.5 \il ddH 2 0, 15.0 \xl total 
5 reaction volume). 

Reactions were allowed to incubate in 96-well plate at . 14 °C overnight in the PCR 
machine. Transformations, into E. coli for in vivo amplification were proceeded the 
following day. 

10 

The nucleotide sequences of the forward and reverse primer pairs used for the polarity 
test were generated in a similar manner, from the nucleotide sequence information 
obtained from the Genomic Therapeutics Corporation database for Streptococcus 
pneumoniae. The primer pairs were each used in a PCR reaction to generate a unique 
15 fragment of the candidate gene targeted for the polarity test. The fragment generated for 
the polarity test included the entire ceg coding sequence region but lacking the expression 
regulatory sequences. 

Transformation into E. coli (strain LE392): 

20 

The next day, 3 \il of above ligation mix was used per transformation reaction plus 50 \xl 
LE392 competent cells. Reactions were set up in 96-well plate format; incubated on ice 
for 30 minutes; heat-shocked at 42° C for 90 seconds; and incubated on ice 2 minutes; 
100 |il SOC media (Gibco BRL) was added; then incubated at 37° C on platform shaker 
25 for 1 hour; plated on LB/chloramphenicol (13.0 \xg/xal) agar plates for constructs over 
night at 37° C with plates inverted and proceeded with colony PCR to confirm constructs. 
The universal primers flanking the insert site in pEVP-3 were used for PCR 
amplification. 

30 The colony PCR involved the following. 96-well plate format was set up (36.5 ^1 H2O, 
0.5 ill pEPV3 forward primer (0.25 ^ig/^il), 0.5 \il pEPV3 reverse primer (0.25 \ig/\x}\ 1.5 
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|il each (6.0 \il total) dNTPs (10 mM), 0.5 nl Vent™ DNA polymerase, 5 \il 10x Vent™ 
buffer, 1 ^1 of a 1 :50 cell dilution, 50 |il total volume). 

pEPV3 forward primer: 5' CATCAAGCTTATCGATACCGTCG 3' (SEQ ID NO:437) 
5 p EPV3 reverse primer: 5 ' CACAGTAGTTCACCACCTTTTCCC 3 5 (SEQ ID NO:43 8) 

Colonies of E. coli LE392 .were picked onto a master plate of LB + 13 ng/ml 
chloramphenicol (incubate throughout the day at 37° C) and then into 50 \il H2O which 
has been placed into a 96-weIl plate. 1 |il of this dilution was used in above PCR reaction 
1 0 (if the 96-well dilution plate is kept you will not need to prepare a master plate). Cultures 
for minipreps of plasmid candidates may be prepared directly from the cell dilutions. 

The PCR program was run (95 °C - 5 minutes, 30 Cycles of: 95 °C - 1 minute, 58 °C - 1 
minute, 72 °C - 30 seconds, 72 °C - 10 minutes, 4 °C - hold). 

15 

A 10 |il/ reaction was run on a 1.0 % TBE gel. A gel designed for 96 well plates and a 
multichannel pipettor were used to ease loading of the sample rows. The gel was run and 
stained with ethidium bromide. The positive clones were identified with appropriate 
molecular size insert(s), amplified by the flanking pEVP-3 primers. 

20 

Minipreps Of Plasmids To Identify Cells Carrying The Pevp-3 Vector With An Insert 

The constructs that carried an insert were identified. The constructs having an insert 
were inoculated into a 5 ml LB/Cm culture, and incubated over night at 37 °C with 
25 aeration. Miniprep plasmid DNA was prepared by a standard procedure. The miniprep 
. DNA was digested with appropriate restriction enzymes to confirm the presence of the 
insert (enzymes flank Smal site in pEVP-3) (10 (il miniprep DNA, 2 jil 10 x buffer, 1 ^1 
Xbal, 1 nl Xhol, 6 \il ddH20; 20 \il total volume for digest). 
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To confirm the presence of an insert, the digest reactions were electrophoresed on an 
agarose gel and the gel was stained with ethidium bromide. The positive clones were 
used for the £ pneumoniae KNOCKOUTs procedure. 

5 The confirmatory PGR reactions, using knock out-specific primers (quality control step) 
involved 35.5 |al H 2 0, 5 ^1 10 * Vent™ buffer, 1 \il knockout forward primer (0.5 \ig/\i\), 
1 yl knockout reverse primer (0.5 ^g/|il), 0.5 |il Vent™ (6.0 ^1 total) DNA Polymerase 
(2000 U/ml), 1.5 ^1 each dNTPs (lOmM, 6.0 j^l total), 1.0 \i\ miniprep DNA from test 
clone, 50 |il total reaction volume. The PCR program was as follows: 95 °C for 5 
10 " minutes, 30 Cycles of: 95 6 C for 1 minute, 60 °C for 1 minute, 72 °C for 30 seconds, 72 
°C for 1 0 minutes, hold at 4 °C. The presence of the correct-sized insert was confirmed 
by agarose gel electrophoresis and ethidium bromide staining. The confirmed clones 
were used for the £ pneumoniae gene KNOCKOUT procedure. Glycerol stocks were 
made of all positive E, coli LE392 constructs and frozen at - 80 degrees C. 

15 

EXAMPLE 3 

The following provides a description of the high throughput gene disruption procedure 
used in S. pneunomiae strain (e.g., gene knockout procedure). The candidate ceg 
20 fragments that were generated by the method described in Example 2 were used in the 
gene disruption procedure in order to identify ceg nucleotide sequences that are required 
for cell viability. 

Reactions were set up in a 1.5 ml eppendorf tubes or 96 well plate (1 jig total of miniprep 
25 pEVP-3 + insert DNA (usually 10 |al of Qiagen miniprep DNA); then 200 ^1 of S. 
pneumoniae (strain Rx-1) competent cells diluted 1:10 in competence media was added 
(1 ml of competence media = 980 ^1 Todd H$vitt (Difco Laboratories) with 0.5% yeast 
extract, 20 \xl 10% BSA, 1 \il 10 % CaC12, and 0.5 ^1 (200 jig/ml) Csp-1 competence 
peptide). 

30 
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Controls were run with each KNOCKOUT experiment and involved 1 \xg pEPV3 Lyt A 
construct = positive control (non-essential), or 1 pEPV3 Fts Z construct = negative 
control (essential). Then the 96 well plates and controls were incubated at 37 °C for 2.5 
to 3 hours in 37 °C room without shaking. The 200 (il of the samples were plated on 
5 Todd Hewitt agar plates with 0.5% yeast extract and 2 ng/ml chloramphenicol. 

The samples were incubate over night at 37 °C in 5% CO2 incubator. Control plates were 
checked for presence of colonies (pEVP-3::lytA) and no growth (pEVP-3::ftsZ). Plates 
were examined for growth (ca. 70-150 colonies) designating nonessentials and zero 
1 0 colonies designating essential genes. 

The polarity test was performed in a similar manner, using the polarity fragments, 
described in Example 3. 

15 

EXAMPLE 4 

The following provides a description of the autolysin procedure used to determine that 
the non-essential control samples of S pneumoniae contain a disrupted lytA gene. 

20 

Phenotvpic Autolysin Test 

The culture plates containing transformants carrying the lytA control vector were flooded 
25 with 0.1% deoxycholate in H2O. The plates were observed after 5-10 minutes. Plates 
with "ghosts" indicated intact lytA gene, or plates without "ghosts" indicated a disrupted 
lytA gene. The "ghost" phenomenon is due to detergent triggered autolysis of the cells, 
causing a gradual fading of the colonies. 

30 The detergent treatment triggers the autolysin in lytA intact cells; it cannot trigger the 
autolysin (lytA gene product) in lytA disrupted cells. Colonies with intact lytA "ghost" in 
5-1 0 minutes due to massive pneumococcal cell lysis. 
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EXAMPLE 5 

The following provides a description of the procedure used to express the CEG proteins 
(e.g., designated CFE proteins) in E. coli cells. 

CEG Protein Production 

Full-length ceg gene were inserted into pET-21 expression vector using the E. coli BL21 
A,DE3 expression system using the following method: 

For each ceg, custom primers were used to insert N- and C- termini into vectors such that 
the 5' end (N-terminus of the CEG) is positioned properly for expression behind the T7 
promoter and optimally placed with regard to the pET ribosome binding site. The pET 
vectors contain an Ndel site which allows positioning of ATG start site in the vector. In 
cases where the ceg sequence contains an internal Ndel site, blunt ligation of the ceg PCR 
fragment into the vector is accomplished via Klenow fill-in of the Ndel site. In many 
cases, primers were also designed such that the ceg 3' (C-terminus of the expressed 
protein) will contain an in-frame extension of 6X-histidine residues, encoded in the 
vector sequence of pET-21. The individual cegs were PCR amplified via custom 
designed primers as described above. Both ceg PCR and vector DNA were digested with 
appropriate restriction enzymes. The full-length ceg were ligated into the pET 
expression vector. The ligation mixture was transformed into competant E. coli BL21 
XDE3 cells and selected for transformants on LB agar with 50 |ig/ml ampicillin. Positive 
insert bearing clones were screened via minipreps of the plasmids and size analysis on 
0.8% agarose gels, with detection by ethidium bromide staining, as above. 

Protein Production 

The proper reading frame of each ceg inserted into pET-21 is verified by DNA 
sequencing. 
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A small (2-£ ml) test culture of E. coll BL21 A.DE3 with the insert-bearing plasmid is 
tested for protein expression by IPTG induction of the expression vector for 1-2 hours. 
The expression is verified by SDS-Polyacrylamide Gel Electrophoresis analysis of a 
whole cell extract (SDS extract of 0.5-1 ml of cells treated at 100 °C for 5 minutes) to 
determine whether the protein is over-expressed and migrates at the correct predicted 
molecular weight. 

The protein is overproduced and purified, via the following method. A large scale (500- 
1000ml) culture of E. coli is grown to early logarithmic phase in broth (e.g., LB broth) 
and protein expression induced for 2 hours with IPTG (isopropyl-D-thiogalactoside). 
The cells are harvested by centrifugation (8000 X G; 15 minutes) and the cell pellets 
resuspended in 20 ml. of buffer. The cells are lysed by sonication, and the supernatant 
fluid centrifuged at low speed (5000 X G, 15 min.) to remove unbroken cells; The 
supernatant fluid, containing the over-expressed protein is subjected to Ni- NTA affinity 
column chromatography (Quiagen, Inc., Chatsworth, CA). The 6X-histidine residues 
linked at the C-terminal end of the CEG proteins permit rapid protein purification via 
selective binding to a Ni-NTA resin column. The protein-bound Ni-NTA resin was to 
remove contaminants, and the bound proteins subsequently eluted with imidazole and 
recovered. It is possible to upscale this procedure to larger volumes for higher yields of 
proteins. 

EXAMPLE 6 

The following provides a description of the methods used to purify all 2CEG 
polypeptides (e.g., 2CFE polypeptides #19-117; SEQ ID NOS:349-436) having a 
histidine tag at their G-terminal ends. The 2CEG polypeptides having the his-tags were 
produced by the methods described in Example 5, supra. As an example, results of 
purification of 2CFE 75 polypeptide are presented. 
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Production Of The CFE Polypeptides 

The BL21ADE3 cells harboring recombinant pET-21 vectors carrying a 2CFE nucleotide 
sequence (SEQ ID NOS:244-331) were cultured in LB broth containing ampicillin. 
. 5 When the Aeoo reached approximately 0.6, protein production was induced by adding 1 .0 
mM of IPTG, the cells were cuitured for an additional 2 hours. The cell pellet was 
collected by centrifugation, and the collected cell pellet was sonicated in Solution A (50 
mM NaP0 4 ; 300 mM NaCl, pH 8.0). The sonicated cells were centrifuged at 10,000 
• RPM to remove the debris. 

10 

Purification Of The CFE Polypeptide 

The supernatant was diluted with Solution A, loaded onto a Ni-NTA column (Quiagen) 
equilibrated with Solution A; the column bed size was 2.5 x 25 cm, and the flow rate was 

15 approximately 3.0 ml/minute. The 2CFE protein was eluted using a linear gradient of 
imidazole, using 0-250 mM in 450 ml, flow rate approximately 3.0 ml/minute. The 
eluted samples were collected as 22 ml fractions per tube and the eluted samples were 
monitored using spectrophotometry. The amount of protein in the eluted fractions was 
estimated using the Bradford method (Bradford, M. M., 1976 Anal. Biochem. 72:248) and 

20 the samples were run on an SDS-PAGE gel (Novex EC6008) (Figure 3 A). Fractions 
were selected for pooling based on the results of the SDS-PAGE gel. The pooled 
fractions were concentrated using a 10,000 MW Centricon (Amicon) to approximately 5 
ml. 

25 The 2CFE 75 polypeptide, a precipitate formed and was redissolved upon increasing the 
sample volume and removing the imidazole by repeated concentration in 50 mM Tris, 
100 mM NaCl, pH 7.5. Varying amounts of the 2CFE 75 polypeptide were diluted in 
either 20 mM Tris, 20 mM KC1, pH* 7.5 or 20 mM Tris, 20 mM MgCl 2 , pH 7.5 at 
• concentrations of 12, 24, or 36 ug/ml. The diluted samples were electrophoresed on an 

30 SDS-PAGE gel under non-reducing conditions (Figure 3 B). The results of Figure 3 B 
suggests that 2CFE 75 forms a multimer. 
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EXAMPLE 7 

the following provides a description of the methods used to purify CEG polypeptides 
5 that lack a histidine tag (e.g., 2CFE polypeptides #1-17; SEQ ID NOS;332-348). As an 
example, the results of purification of CFE 3 polypeptide are presented. 

Purification of the CFE 3 Polypeptide 

10 The 2CFE 3 polypeptide was produced using the large scale IPTG-induced method 
described in Example 5, supra. The 2CFE 3 (SEQ ID NO:334) polypeptide lacks a C- 
terminal histidine tag. The 2CFE 3 polypeptide was purified using a 2-column 
procedure. The 2CFE 3 polypeptide preparation was eluted from a 26/10 Q Sepharose 
column (Pharmacia) using a 0-1.0 M NaCl gradient, 2 ml/minute flow rate, and the 

15 gradient size was 1 liter. Then the 2CFE 3 polypeptide was eluted from a hydroxyapatite 
Bio-gel column (Bio-Rad) using a 5-200 mM potassium phosphate (pH 8.0) gradient, the 
flow rate was 0.3 ml/minute, and the gradient size was 300 ml. A sample of the 2CFE 3 
preparation was run on a polyacrylamide gel (Figure 4). 

20 EXAMPLE 8 

The following provides a description of the size exclusion chromatography methods used 
to estimate the molecular weight and determine whether the CEG polypeptides 
oligomerize. The CFE polypeptide may olimerize to form monomers, dimers, tetramers, 
25 hexameric rings, or other oligomeric forms. 

Size exclusion chromatography was performed on all isolated 2CFE polypeptides #s 1- 
117 (e.g., SEQ ID NOS:332-436). This method was performed using various types of 
columns, depending on the particular 2CFE polypepeptide tested. 

30 
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The Biosil SEC-125 HPLC Gel Filtration column (BioRad Laboratories, Inc) was used, 
for example, to characterize CFE 8. The mobile phase was 0.2 M KH 2 P0 4 , Q.9% NaCl 
pH6.8. 

5 The Phenomenex 600 x 7.5 mm Biosep SECS 3000 column was used, for example to 
characterize 2CFE 21 and 39. The mobile phase for size exclusion was 50 mM 
Na 2 HP0 4 , pH 7.0 and 150 mM NaCl run at 1 ml/minute in a Gilson HPLC system, with 
protein detection at 280 nm. 

10 EXAMPLE 9 

The following provides a description of the computer-aided methods used to search for 
similarities between the amino acid sequences of the CEG polypeptides and sequences 
available through public and proprietary databases. In many cases, the function of the 
15 CEG polypeptides was suggested by the results of the similarity searches. The function 
of some of these CEG polypeptides has been confirmed by performing additional 
analyses. Table V provides a list of the suggested and confirmed functions of CEG 
polypeptides designated CFEs #1-1 17. 

20 The- putative function of the CFE polypeptides were determined using computer-aided 
bioinformatic approaches, including distant homologies, motif searching, or predictions 
based on statistical rules. For example, the distant homology approach involved pairwise or 
multiple sequence alignments, employing tools such as FASTA, and Psi-BLAST. The 
motif searching approach involved using sophisticated hidden Markov models. The 

25 approach based upon predictions of statistical rules involved prediction of transmembrane 
regions, coiled-coil, and other structural motifs. These approaches have been reviewed in 
Computational Methods In Molecular Biology 1998, eds. Salxber, S.L., Searls, D.B. Searls, 
and Kasif, S. , Elsevier, and in Bioinformatics: A Practical Guide To The Analysis Of Genes 
And Proteins 1998 eds Baxevanis, A. D. and Francis Ouellete, B.F. , Wiley-Interscience. 

30 
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Global sequence similarity searches were performed using the amino acid sequences of 
all the conserved essential gene sequences (e.g., CFEs 1-117; SEQ ID NOS:l 14-226) to 
search against a non-redundant protein database using the BLAST2 algorithm (Altschul 
S.F, et al, 1997 Nucleic Acids Res. 25(17):3389-3402). In a similar search, similar 
5 sequences were identified in the Concordance database using the "Neighbor" function 
(Bruccoleri R. E., Dougherty T.J., Davison D.B. 1998 Nucleic Acids Res. 26(19):4482- 
4486). To determine if the predicted amino acid sequences were full length and in the 
proper reading frame, BLAST-type searching and CLUSTAL multiple sequence 
alignments (Higgins D.G, et al., 1996 Methods Enzymol 266:383-402) were used. 

10 Local sequence similarity searches were performed, by searching for Prosite (Hofinann 
K., et al., 1999 Nucleic Acids Res. 27(1):215-219) and Pfam motifs (Bateman A., et al, 
2000 Nucleic Acids Res. 28(l):263-266). Additionally, the amino acid sequences of the 
CFEs were analyzed by performing protein threading analyses using the ProCeryon fold 
recognition program (Sippl, et al, 1992 Proteins 13:258-271; Sippl, J. 1993 J. Comp. 

15 Aided Mol Design 7:473-501 ; Avww.proceryon.com) and Geneformatics. 

In bacteria, many operons include genes encoding different proteins that catalyze discrete 
steps of a common biochemical paithway. Therefore, the operon structures in S. 
pneumoniae was compared with that in other bacteria in order to predict the function of 
20 CFE polypeptides. . 

Additionally, analysis of bacterial metabolic pathways were performed using Pathway 
Tools from DoubleTwist, based on the EcoCyc system (Karp P.D., et al., 1999 Nucleic 
Acids Res. 1999 27(l):55-58). This analysis was used to predict which CFEs mediate 
25 various steps of the pathways. 

When the sequence identity between a CFE polypeptide and the annotated database (e.g., 
SwissProt, Genbank) was low (e.g., sequence identity less than about 30%), a Protein 
Threading (e.g., fold recognition) method was used to predict similarities in the folded 
30 protein structure of CFE polypeptides in the absence of a high level of sequence similarity 
with proteins in the databases (review by Teichmann, et al, 1999 Current Opinion in 
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Structural Biology 9:390-399). . The Protein Threading method predicts the compatibility of 
a query sequence (e.g., CFE polypeptide sequences) with each of the folds in a library of 
known protein structures. The library of known protein structures as developed, maintained, 
and updated throughout the search process. 

A list of potential structural folds, onto which each query was compatible, was generated for 
all CFE polypeptides (e.g, SEQ ID NOS:l 14-226). The fold assignments for each query 
were used to generate pairwise sequence alignments. The pairwise sequence alignments 
were used to generate protein models of the query polypeptide (e.g., CFE polypeptides). 

The pairwise sequence alignments were also used to compare the position of critical 
residues of the structural template with the query polypeptide. The list of critical residues 
was generated by using multiple sequence alignments derived from a structural 
classification of proteins to generate a conservation profile which provided sequence- 
specific positions conserved across a homologous family of protein folds. Comparative 
modeling was used to search the model of the query polypeptide for the critical residues and 
determine whether the structural and functional motifs are conserved in the query protein. 
Conservation of structural and functional motifs permitted assignment of putative structure 
and function to a query polypeptide sequence. 

The Protein Threading method was used to search for putative folded structure and function 
for all CFE polypeptides (SEQ ID NOS: 1 14-226). The CFE polypeptides having significant 
sequence identity (e.g., more than 30%) to known proteins were assigned putative functions 
with a high level of confidence. 

EXAMPLE 10 

The following provides a description of the methods used to characterize purified, CFE 
101 polypeptide. The 2CFE ; 101 polypeptide mediates the conversion of pantothenate to 
4' phosphophantothenate, and is predicted to be a pantothenate kinase. 
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Computer-Aided Comparison 

The computer-aided comparison, as described in Example 9 supra, suggests that the 
amino acid sequence of the CFE 101 polypeptide (SEQ ID NO:210) is 42% similar to the 
5 amino acid sequence of the coaA protein of E. coll Thus, CFE 101 may be a 
pantothenate kinase, which mediates the conversion of pantothenate to 4' 
phosphophantothenate (Figure 5). 

Circular Dichroism and Circular Dichroism Thermal Melt Analysis 

10 

Circular dichroism and circular dichroism melt methods were used to determine the 
folded structure of the expressed and isolated 2CFE polypeptides. For example, this 
method was used to characterize the folded structure of isolated 2CFE 101 (SEQ ID 
NO:421). 

15 

The starting concentration of the 2CFE 101 polypeptide was such that OD205 was 
approximately 1.5, and the OD 2 80 was approximately 0.05 (e.g., 0.05 to 0.1 mg/ml). The 
starting concentration of 2CFE 101 was approximately 344 ^iM in 50% glycerol, 50 mM 
Tris, 100 mM NaCl, 5 mM MgCl 2j 0.5. mM EDTA, at pH 7.5. The polypeptide was 

20 diluted to a final concentration of 7 jaM, as determined by absorbance at A280, in 20 mM 
Na-phosphate, 100 mM KG, at pH 7.0. The circular dichroism analysis was performed 
using quartz cuvettes, the instrumentation was from JASCO (Model J-720), the readings 
were performed at 25 degrees C (Figure 6 A). The band width was 1 nm, the sensitivity 
was 20 mdeg, the response was 0.25 seconds, the scan speed was 50 nm/minute, and the 

25 step was 0.5. The circular dichroism thermal melt analysis was performed at a range of 
between 0 and 100 degrees C (Figure 6 B). Additionally, the circular dichroism was 
performed comparing monomer and aggregate pools of 2CFE 101. 
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Size Exclusion Analyses 

Size exclusion chromatography methods were performed using the Biosil SEC column, 
as described in Example 8 supra. The results suggest that the 2CFE 101 polypeptide 
forms monomer (40,200 Da) and oligomers (194,000 Da). The specific activity of the 
monomer and oligomeric forms of 2CFE 101 were determined, as described below. 

Biochemical Assays 

The biochemical assays of the 2CFE 101 polypeptide was based on the PK/LDH coupled 
enzyme assays described by Vallari, D. S., et al. (1987 J. Biol Chem. 262:2468-2471) 
and Song, W. -J., et al., (1994 J. Biol. Chem, 269:27051-27058). 

Briefly, the assay was performed as follows. The reaction included: 885 (il of 0.1 M 
Tris-HCl (pH 7.6), 25 fil NADH (14.1 mM), 20 yl ATP (10.7 mM), 50 nl phospho-enol- 
pyruvate (56 mM), 5 pi LDH/PK (lactose dehydrogenase/PK; Sigma, catalog # P-0294, 
60 U/ ml PK, 1050 U/ml LDH), 5 pi of the 2CFE 101 polypeptide (9 mg/ml in 50 mM 
Tris-HCl, pH 7.5, 100 mM NaCl which was diluted to 4.5 mg/ml in 50% glycerol). The 
reaction was started by adding 10 pi pantothenate (100 mM; Sigma, catalog # P2250). 
The production of ADP in the reaction was monitored by measuring the absorbance a 340 
nm. The results in Figure 8 show that the. 2CFE 101 polypeptide mediates ADP 
production in the presence of pantothenate and ATP. The K m of pantothenate (n=4) was 
144 (±16.5) pM, the V max of the 2CFE 101 polypeptide (n=4) was 2.04 (±0.25) pM min" 1 
mg* 1 . The. monomer form has a specific activity of approximately 1.7 jiM min" 1 mg" 1 . 
The oligomeric form has a specific activity of 0.26 pM min" 1 mg" 1 . 

Alternatively, the 2CFE 101 polypeptide can be tested in an assay that monitors the 
conversion of pantothenate to 4 , -phosphopantothenate. The same reaction described 
above can be used, except l4 C-labeled pantothenate is used. The. reaction can be 
monitored by measuring the amount of I4 C-labeled 4'-phosphopantothanate produced. 
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EXAMPLE 11 

The following provides a description of the methods used to characterize purified, CFE 
39 and CFE 21 polypeptides, carrying a C-terminal histidine 6-tag. The methods include 
5 helicase reactions, in which synthetic Holliday Junction templates are resolved into 
duplex structures. In one method, helicase reaction was monitored using radiolabeled 
templates. In another method, the helicase assay was adapted for use in a high 
throughput assay employing fluorescence labeled templates. 

10 Computer- Aided Comparison 

The computer-aided comparison, as described in Example 9 supra, suggests that the CFE 
39 polypeptide (SEQ ID NO: 148) is an RuvA homologue. The comparison also 
suggests that CFE 21 (SEQ ID NO: 132) is an RuvB homologue. 

15 

Previous studies by Parsons and others have shown that RuvA and RuvB proteins, in E. 
coli, promote branch migration or movement of Holliday Junctions during genetic 
recombination and DNA repair (Parsons, C. A., et al., 1992 Proc. Natl, Acad Sci. USA 
89:5452-5456; Tsaneva, I. R., et al., 1993 Proc. Natl, Acad. Set USA 90:1315-1319; 
20 Muller, B., et al., 1993 J. Biol Chem. 268:17179-17184; Mitchell, A. H. and S. C. West 
1996 J. Biol Chem, 271:19497-19502; Parsons, C. A. and S. C. West 1993 J. Molec. 
Biol 232:397-405; Tsaneva, I. R., et al., 1992 Molec. Gen. Genet. 235:1-10; Mitchell, A. 
H. and S. C. West 1 994 1 Molec. Biol 1 994 243 :208-2 1 5). 

25 Size Exclusion Chromatography 

Size exclusion chromatography was performed on 2CFE 39 (SEQ ID NO:366) and 2CFE 
21 (SEQ' ID NO:350) using the Phenomenex 600 x 7.5 mm Biosep SECS 3000 column, 
as described in Example 8 supra. Protein standards (BioRad) were used to calibrate the 
30 column, including thyroglobulin (670,000 Da), gamma globulin (158,000 Da), ovalbumin 
(44,00 Da), myoglobin (17,00 Da), and B-12 (1350 Da). 
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The results indicate that 2CFE 39 (RuvA) forms tetramers and 2CFE 21 (RuvB) forms a 
hexameric ring structure. Selected eluted samples were electrophoresed on a 
. polyacrylamide gel (Novagen) (Figure 9). 

5 

The Hollidav Junction Analysis Using Radiolabeled Templates 

The Holliday Junctio.n analysis was performed using radiolabeled, synthetic, 
asymmetrical, Holliday Junction templates, as described in Hiom, K. and S. C. West 
10 1995 Cell 80:787-793. The Holliday Junction templates were produced by annealing 
together four separate, single-stranded, oligonucleotide strands to form four-stranded 
structures (e.g., the Holliday Junction template). The Holliday Junction templates were 
reacted with the 2CFE 39 and 2CFE 21 polypeptides, in a helicase reaction, to test their 
ability to generate two duplex structures. 

15 

Producing the Synthetic Hollidav Junction Templates 

The asymmetrical Holliday Junction templates were produced by annealing the following 
oligonucleotide sequences: 

20 

Oligonucleotide strand 1 : 

5'-CCAGTGATCACATACGCTTTGCTAGGACATCTTGATATCAGCCCACGTT 
CACCCGCCTACCAGTGCCACGTTGTATGCCCACGTTGACC-3 5 (SEQ ID NO:438) 

25 Oligonucleotide strand 2: 

5'-GGGTCAACGTGGGCATACAACGTGGCACTGGTAGGCGGGTGAACGTGGG 
CTGATATCAAGATGTCCATCTGTCCGTTCATCTATGACGT-3' (SEQ ID NO:439) 

Oligonucleotide strand 3: 
30 5 '-AACGTCATAGATGAACGGACAGATCATGGTGCTTTTAAAGTCTAGAGAC 
TATCGAGCATTAGTACCAGTATCGAATCCGTCTTGTCAA-3' (SEQ ID NO:440) 
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Oligonucleotide strand 4: 

5>.TTTGACAAGACGGATTCGATACTGGTACTAATGCTCGATAGTCTCTAGAC 
TTTAAAAGCACCATGTAGCAAAGCGTATGTGATC ACTG-3 ' (SEQ ID NO:441) 

Oligonucleotide strand 3 was labeled at the 5' end using approximately 300 ng of 
oligonucleotide strand 3, 1 \il lOx Phosphate Buffer, 5 |il 32 P ATP, 1 \xl T4 polynuclotide 
kinase (Gibco-BRL)), in a 10 jal volume, and the reaction was performed at 37 degrees C 
for 30 minutes. The reaction was loaded onto a G50 column to remove the 
unincorporated radiolabel. The final concentration of the radiolabeled oligonucleotide 
strand 3 was approximately 15 ng per jjl. 

Approximately equimolar amounts of the four oligonucleotide strands were annealed 
(e t g., hybridized). The annealing reaction included: 5 Annealing Buffer (200 mM 
Tris-Cl pH 8.0, 100 mM MgCl 2 , 1 M NaCl, 10 mM DTT); 450 ng of radiolabeled 
oligonucleotide strand 3; and 1000 ng each of oligonucleotide strands 1, 2, and 4; in 50 jil 
total reaction volume. The control annealing reaction included: 5 |il Annealing Buffer, 
60 ng radiolabeled oligonucleotide strand 3; 1000 ng oligonucleotide strand 4; in 50 pi 
total reaction volume. Annealing was performed at 95 degrees C for 5 minutes, 65 
degrees C for 30 minutes, 42 degrees C for 30 minutes, and room temperature (e.g., 
between about 23 to 27 degrees C) for 30 minutes to generate the synthetic Holliday 
Junction templates. The synthetic Holliday Junction templates were gel or column- 
purified to remove the duplex and non-annealed products. As a control, oligonucleotide 
strands 3 and 4 were annealed to form duplex structures. The synthetic Holliday Junction 
templates and duplex structures were stored at -20 degrees C. 

CFE 39 and CFE 21 : The Helicase Reaction Using Radiolabeled Templates 

The helicase reaction was performed to determine whether 2CFE 39 and 2CFE 21 
resolved the synthetic Holliday Junction templates into duplex structures. The helicase 
reaction was performed as follows. A 50 pi total reaction volume included: 25 pi of 2x 
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Reaction Buffer (50 mM Tris-Cl pH8.0, 30 mM MgCl 2 , 2 mM ATP); 1 jal synthetic 
Holliday Junction template (36 ng); 2 ^1 2CFE 39 (1 |iM); and 2 \il 2CFE 21 (1 ^M). 
The reaction was incubated at 37 degrees for 30 minutes. The reaction was stopped by 
adding 5 jil Stop Buffer (100 mM Tris-Cl pH 7.5, 5 mg/ml Proteinase-K, 5% SDS). The 
5 stopped reaction was returned to 37 degrees C for 5 minutes. The helicase reaction was 
loaded onto and run on a non-denaturing, 12% PAGE, Tris-glycine gel. 

The results shown in Figure 10, lanes 6, 7 and 8, indicate that the 2CFE 39 and 2CFE 21 
polypeptides resolved the synthetic Holliday Junction templates into duplex structures. 

10 

CFE 39: The Helicase Reaction 

It has been previously shown that E. coli RuvA binds to Holliday Junction templates 
(Parsons, C. A., et al., 1992 Proc. Natl, Acad Sci USA 89:5452-5456). The ability of S. 

15 pneumoniae CFE 39 to bind to a Holliday Junction template can be tested by employing 
the helicase assay described herein. The results of the helicase assay can be monitored by 
performing a gel shift assay and/or capillary electrophoresis. The presence of a Holliday 
Junction template bound to 2CFE 39, which migrates more slowly than the Holliday 
Junction template alone, would indicate that & pneumoniae 2CFE 39 binds to Holliday 

20 Junction templates. 

CFE 39 and CFE 21 : Holliday Junction Analysis Using Fluorescent-Labeled Templates 

The helicase reaction described herein was performed using Holliday Junction templates 
25 having one oligonucleotide strand labeled with a fluorescent agent and another strand 
labeled with a quenching agent. The 5' fluorescent end and the 3' quenching end of the 
strands that make up the Holliday Junction templates are in proximity to each other, 
resulting .in a non-fluorescent template. When the Holliday Junction templates are 
resolved into duplex structures, the fluorescent and quench ends are not in proximity to 
30 each other, resulting in fluorescence. 
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The Holliday Junction templates used to perform this experiment comprised the 
following: the 5 5 end of oligonucleotide strand 1 was labeled with a fluorescein (e.g., the 
fluorescent agent), and the 3* end of oligonucleotide strand 4 was labeled with DABCYL 
(e.g., the quenching agent). The oligonucleotide strand 1 labeled with fluorescein and the 
5 oligonucleotide strand 4 labeled with DABCYL were custom synthesized (Gibco-BRL 
Life Technologies, Inc.). 

The fluorescein and DABCYL labled oligonucleotides were annealed in a reaction, as 
described above, to generate synthetic Holliday Junction templates. The helicase reaction 
10 was performed as described above. The results of the helicase reaction were monitored 
by measuring the unquenching of the Holliday Junction templates with time (Figure 11). 

The helicase assay using Holliday Junction templates labeled with fluorescent-quenching 
agents can be adapted for use in high throughput analyses to test 2CFE 39, 2CFE 21, and 
1 5 other polypeptides for their ability to resolve the templates into duplex structures. 

EXAMPLE 12 

The following provides a description of the methods used to characterize purified, CFE 8 
20 polypeptide, which lacks a histidine tag. The CFE 8 is a putative DNA single-stranded 
binding protein. 

Computer- Aided Comparison 

25 The computer-aided comparison, as described in Example 9 supra, suggests that the CFE 
8 polypeptide (SEQ ID NO: 121) may be a single stand binding protein homologue, such 
asSSB. 
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The 2CFE 8 polypeptide (SEQ ID NO:339) was characterized by size exclusion 
chromatography, using the Biosil SEC- 125 HPLC Gel Filtration column as described in 
5 Example 8 supra. The chromatogram showed one peak corresponding to a molecular 
weight of approximately 89 kDa. Based on the nucleotide sequence, the predicted 
molecular weight of 2CFE 8 is 17,351 Da. In non-denaturing conditions, 2CFE 8 forms a 
multimer. 

10 Binding Reaction 

The 2CFE 8 polypeptide was reacted with a single-stranded oligonucleotide A. Briefly, 
the binding reaction included: 50 \iM of 2CFE 8 polypeptide, 50 |iM oligo strand A, 20 
mM Tris/20 mM KC1 pH 7.5. The binding reaction was performed at 37 degrees C, for 2 
15 hours. 

Oligonucleotide strand A: 

5 ' -TTAGGGCCCGGGCTATCTTAC AATCTCGTT-3 ' (SEQ ID NO:442) 
20 Capillary Electrophoresis 

The results of the binding reaction was monitored by capillary electrophoresis, following 
the methods described in "Handbook of Capillary Electrophoresis" 2 nd Edition, 1997, ed. 
J. Landers. 

25 

Separation was performed using an uncoated capillary tube (360 jim o.d., 50 \\m i.d., 
with a 50 cm effective separation length; Watrex International, Inc., Pittsford, NY) and 
50 mM borate pH 9.3 as the mobile phase, at 25 kVolts, 20 minutes separation time. 

30 The results indicate that 2CFE 8 alone elutes as a sharp peak, indicating little adsorption 
to the uncoated capillary wall (Figure 12 A). The shape of the peak and peak retention 
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time changed with 2CFE 8 in the presence of all oligonucleotides tested (Figure 12 B). 
As a negative control, MurB polypeptide (Pucci, M. J., L. F. Discotto, and T. J. 
Dougherty 1992 "Cloning and Identification of the Escherichia coli murB DNA 
sequence, which encodes UDP-N-acetylenolpyruvoylglucosamine reductase" J. 
Bacterial 174:1690-1693) was reacted with the same oligonucleotides. MurB reacted 
with or with out the oligonucleotides showed no change in peak shape or retention time. 

After capillary electrophoresis analyses, the 2CFE8 alone and 2CFE plus oligonucleotide 
samples were run on native polyacrylamide gels to determine whether the polypeptide 
was intact. The results indicate that in all cases, 2CFE 8 was intact and had not degraded 
with time or storage. 

Mobility Shift Assays 

The ability of 2CFE 8 polypeptide to bind oligonucleotide strand A was tested in a 
mobility shift assay. 

The results indicate that 2CFE 8 binds single stranded oligonucleotides (Figure 13 A and 
B). In Figure 13 A, the gel was stained with ethidium bromide. The unbound 
oligonucleotides appear near the bottom of the gel, while the bound oligonucleotides 
appear near the middle. The same gel was stained with Coomassie (Figure 13 B), 
revealing that 2CFE 8 polypeptide bound -to the oligonucleotide migrated further than 
unbound 2CFE 8, due to the change in charge carried by the oligonucleotide. Various 
ratios of 2CFE8:oligo were tested. The optimal binding ratio was 2:1 . 

The Effect of MgCh 

The 2CFE 8 polypeptide precipitated in the presence of 5 mM MgC^. The precipitation 
was reversible by the addition of 1 \xM of the oligonucleotides tested. The observation 
indicates specific binding between 2CFE 8 polypeptide and the oligonucleotides tested. 
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Scintillation proximity, assay (SPA) methods can be used in a high throughput screening 
procedure to monitor, for example, a binding reaction. SPA utilizes beads (Amersham) 
which are coated on the surface with a particular compound or molecule. For example, 
the SPA bead may be coated with avidin to facilitate binding with any molecule having a 
biotin tag. 

The binding reaction of the 2CFE 8 polypeptide and the oligonucleotide strand A can be 
monitored using SPA beads and a scintillation counter. The beads can be coated with 
avidin, the 2CFE 8 polypeptide can be tagged with biotin, and the oligonucleotide strand 
A can be radiolabeled. 

EXAMPLE 13 

The following provides a description of the methods used to characterize purified, 2CFE 
3 (SEQ ID NO:334) and 2CFE 86 (SEQ ID NO:409) polypeptides. 

The 2CFE 3 polypeptide catalyzes the conversion of D-glucosamine-6-phosphate to D- 
glucosamine-1 -phosphate, indicating that 2CFE 3 mediates amino-sugar biosynthesis 
through the N-acetyl glucosamine pathway (Figure 14). 

The 2CFE 86 polypeptide catalyzes the conversion of D-glucosamine-1 -phosphate to N- 
acetylglucosamine-1 -phosphate, and the conversion of N-acetylglucosamine-1 -phosphate 
to UDP-N-acetylglucosamine-1 -phosphate, which indicates that 2CFE 86 also mediates 
amino-sugar biosynthesis through the N-acetyl glucosamine pathway (Figure 14). 

Computer-Aided Comparisons Of CFE 3 

The computer-aided comparison, as described in Example 9 supra, suggested that the 
CFE 3 polypeptide (SEQ ID NO:l 16) is a phosphoglucosamine mutase, such as GlmM. 
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The 2CFE 3 polypeptide was produced using the large scale IPTG-induced method 
described in Example 5, supra. The 2CFE 3 polypeptide lacks a C-terminal histidine tag. 
The 2CFE 3 polypeptide was purified using a 2-column procedure. The 2CFE 3 
polypeptide preparation was eluted from a 26/10 Q Sepharose column (Pharmacia) using 
a 0-1.0 M NaCl gradient, 2 ml/minute flow rate, and the gradient size was 1 liter. Then 
the 2CFE 3 polypeptide was eluted from a hydroxyapatite Bio-gel column (Bio-Rad) 
using a 5-200 mM potassium phosphate (pH 8.0) gradient, the flow rate was 0.3 
ml/minute, and the gradient size was 300 ml. A sample of the 2CFE 3 preparation was 
electrophoresed on an SDS polyacrylamide gel (Figure 4). 

Affinity Capillary Electrophoresis of CFE 3 

Affinity capillary electrophoresis methods were used to determine whether the 2CFE 3 
polypeptide binds to various glucose derivatives. Binding was performed under 
equilibrium conditions, in which the sugars were dissolved in the running buffer and 
reacts with 2CFE 3 during separation in the column. The affinity capillary 
electrophoresis method used to analyze 2CFE 3 follows the methods described in 
"Handbook of Capillary Electrophoresis" 2 nd Edition, 1997, ed. J. Landers. 

Briefly, 2CFE 3 polypeptide was reacted with increasing amounts of various glucose 
derivatives (e.g., substrate) at 25, 30 and 37 degrees C. .The glucose derivatives included 
UDP-glucose, glucose- 1 -phosphate, glucose-6-phosphate, glucosamine- 1 -phosphate, and 
glucosamine-6-phosphate. The reaction included: 2CFE 3 polypeptide (2.0 mg/mi), 
separation buffer (25 mM Tris; 192 mM Glycine, pH 8.0; BupH Tris-Glycine Buffer 
Packs, Pierce). Separation was performed at 25 kVolts, separation time was 15 or 20 
minutes. 

The results shown in Figure 15 A indicate that at 25 degrees C, 2CFE 3 binds to D- 
glucose-1 -phosphate in a dose-dependent manner, as the peak shape and/or the retention 
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time for 2CFE 3 changes in the presence of 100 and 500 D-glucose-1 -phosphate 
compared to unreacted 2CFE 3. 



The results shown in Figure 15 B indicate that at 25 degrees C, 2CFE 3 binds to D- 
5 glucosamine-6-phosphate in a dose-dependent manner, as the peak shape and/or the 
retention time for 2CFE 3 changes in the presence of 100 and 500 \xM D-glucosamine-6- 
phosphate compared to unreacted 2CFE 3. 

The results shown in Figure 15 C indicate that at 25 degrees C, the 2CFE 3 polypeptide 
10 also binds to glucose-6-phosphate. 

A comparison of 2CFE 3 reacted with various glucose derivatives, at 30 degrees C, is 
shown in Figure 15 D. The results indicate that D-gIucosamine-6-phosphate is a putative 
substrate for 2CFE 3, as this reaction exhibits the greatest change in peak shape and/or 
15 retention time. 

CFE 3: Capillary Electrophoresis and Laser-Induced Fluorescence 

In a further analysis of 2CFE 3 polypeptide, capillary electrophoresis was performed with 
20 laser-induced fluorescence in order to separate and detect interaction between the 
substrate (e.g., D-glucosamine-6-phosphate) and the product (e.g., D-glucosamine-1- 
phosphate) in a one dose, one time-point procedure. 

The 2CFE 3 polypeptide was derivitized by reacting 10 mM FITC (fluorescein 
25 isothiocyanate dissolved in methanol; Calbiochem, San Diego, CA) with D-glucosamine- 
6-phosphate, at ambient temperature, in the dark, overnight. The FITC-derivatized 2CFE 
3 polypeptide (2.0 mg/ml) was reacted with the substrate (D-glucosamine-6-phosphate 
and D-glucosamine-1 -phosphate) for one hour. 

30 Separation was performed using an uncoated capillary (360 nm o.d., 50 |im i.d., with a 
50 cm effective separation length) and 50 mM borate (pH 9.3) as the mobile phase. The 
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argon-ion laser had an excitation wavelength of 488 nm and an emission filter of 520 nm 
(Beckman, Fullerton, CA)f The results shown in Figure 16 indicate that 2CFE 3 binds 
and catalyzes the conversion of D-glucosamine-6-phosphate to D-glucosamine-1- 
phosphate. 

5 . 

Computer-Aided Comparison Of CFE 86 

The comparison results, as described in Example 9 supra, suggested that the CFE 86 
polypeptide (SEQ ID NO: 195) is an acetyltransferase, such as GlmU which is a 

1 0 Afunctional enzyme in E. coli. It has been previously shown that, in E coli, GlmU is a 
Afunctional protein having both the acetyltransferase and uridylyltransferase active sites 
(Mengin-Lecreulx, D. and J. van Heijennort 1994 1 Bacteriol 176:5788-5795; Gehring, 
Al., et al., 1996 Biochemistry 35:579-585). The Afunctional enzyme catalyzes the 
conversion of D-glucosamine-1 -phosphate to N-acetylglucosamine-1 -phosphate 

15 (acetyltransferase), and catalyzes the conversion of N-acetylglucosamine-1 -phosphate to 
UDP-N-acetylglucosmine-l -phosphate (uridylyltransferase). The Km of the 
acetyltransferase and uridylyltransferase reactions has been previously calculated 
(Mengin-Lecreulx, D. and J. van Heijennort 1994 supra ). Additionally, the crystal 
structure of GlmU from R coli is known (Brown, K., et al., 1999 EMBO J. 18:4096- 

20 4107). 

Purification of the CFE 86 Polypeptide 

The 2CFE 86 polypeptide (SEQ ID NO:409) has a C-terminal histidine tag. The 2CFE 
25 86 polypeptide was produced using the large scale IPTG-induced method described in 
Example 5, supra. The 2CFE 86 polypeptide was purified using the Ni-NTA affinity 
column method described in Example 6, supra. The eluted 2CFE 86 polypeptide was 
dialyzed against 50 mM Tris-Cl, 100 mM NaCl, 25% glycerol, pH 8.0. Samples of the 
purified 2CFE 86 polypeptide were electrophoresed on a polyacrylamide gel (Figure 17). 

30 
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A biochemical assay was performed, to determine whether 2CFE 3 and 2CFE 86 convert 
D-glucosamine-6-phosphate to UDP-N-acetylglucosamine-1 -phosphate (e.g., UDPAG). 
5 The 2CFE 3 and 2CFE 86 polypeptides were used in a coupled reaction based on the 
assays described in Jolly, L. P., et al., 1999 Eur. J. Biochem. 262:202-210. 

A time-dependent and dose-dependent assay were performed. Briefly, the assay was 
performed in 96-well plates, each well including 100 jj.1 volume. The assay included: 1 

10 mM D-glucosamine-6-phosphate (Sigma); 0.7 mM D-glucosamine-l,6-diphosphate 
(Sigma); 1.2 mM acetyl-Coenzyme A (Sigma); and 5 mM uridine-S'-phosphate (Sigma); 
3 mM MgCl 2 (Sigma); 50 mM Tris-Cl, pH 8.0 (Life Technologies). The reaction was 
started by adding 1 jig of 2CFE 3; and 10 ng of 2CFE 86. The reaction was performed at 
room temperature. The reaction was stopped at 0, 15, 30, and 65 minutes, by filtering out 

15 the 2CFE polypeptides. 

The results of the assay was monitored by HPLC (high pressure liquid chromatography) 
using an Optisil lOjx SAX column (250 x 4.6 mm), measuring at 262 nm, the mobile 
phase was 150 mM KH2PO4 (pH 3.5), and 1.5 ml/minute flow rate. The results shown in 
20 Figure 18 show the time-dependent assay and indicate that HPLC detected the presence 
of UDPAG. 

CFE 86: The Uridvlvltransferase Reaction 

25 The 2CFE 86 polypeptide was tested in a uridylyltransferase reaction, in which N-acetyl- 
D-glucosamine-1 -phosphate and UTP produce UDP-N-acetylglucosamine. The 
uridylyltransferase reaction was monitored using a malachite green/inorganic 
pyrophosphatase assay (e.g., malachite green-IPPAse assay) and/or monitored using 
HPLC. The malachite green-IPPAse assay was used to measure orthophosphate 

30 production from digestion of the pyrophosphate liberated in the uridylyltransferase 
reaction. 
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The malachite green reagent was prepared as follows. A 0.045 % solution of malachite 
green (Sigma; M9636) was prepared in water. A 4.2 % solution of ammonium 
molybdate (Mallinckrodt) was prepared in 4N HC1. The malachite green and ammonium 
5 molybdate were mixed in a 3:1 ratio, and stirred for about 20 minutes. The mixture was 
filtered, and stored at 4 degrees C. The inorganic pyrophosphatase (Sigma; 1-2267) was 
diluted to 0.1 U/jil in 50 mM Tris/3mM MgCl 2 ph 8.0, and stored at 4 degrees C. 

The -uridylyltransferase reaction was performed in 96-well plates : The coupled reaction 
10 described herein was performed, in the presence of 2CFE 3 alone or 2CFE 3 and 2CFE 
86, and included the addition of 0.5 U/well of the diluted inorganic pyrophosphate. The 
reaction was mixed for 5 minutes at room temperature. The reaction was stopped by the 
addition of 240 ^il/well of the malachite green reagent and 30 ^I/well of 34% sodium 
citrate, and the reaction was mixed. The results of the uridylyltransferase reaction was 
1 5 monitored by spectrophotometry at 660 nm, 

The results of separate uridylyltransferase reactions were monitored by HPLC, using a 
Phenosphere-NEXT CI 8 column (250 x 4.6 mm). The mobile phases included A and B 
as follows: A) methanol/10 mM potassium phosphate pH 6.5 (0:100); and B) 
20 methanol/10 mM potassium phosphate pH 6.5 (40:60). The mobile phases were run 
under the following conditions: 100% mobile phase A for 5 minutes,* to 100% mobile 
phase B in 3 minutes; and hold 100% mobile phase B for 9 minutes. The retention time 
for the UDPAG product is approximately 5.75 to 6.0 minutes. 

25 The results three uridylyltransferase reactions, monitored by HPLC are summarized in 
Table III below. 
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TABLE ni 



Purified CFE 86: 

2CFE 86-1 
2 CFE 86-2 
2 CFE 86-3 



Specific Activity 
(nmol/min/ug): 

3.1 
3.4 
3.1 



The results of the uridylyltransferase reactions, monitored by HPLC or HPLC and 
Malachite Green IPPAse assays are summarized in Table IV below. 

TABLE IV 



Reaction; 

Acetvltransferase reaction: 

Glucosamine- 1-P 
Acetyl-coA 

Uridvlvtransferase reaction: 

N-acetylglucosamine- 1 -P 
UTP 



Km (uMY. 



94 
150 



48 
79 



Method: 



HPLC 
HPLC 



HPLC and MG/IPPAse 
HPLC 



EXAMPLE 14 



The following provides a description of the methods used to characterize various 2CFE 
polypeptides, including CFE 21, 34, 35, 39, and 90. The molecular weight of these 2CFE 
polypeptides were analyzed by size exclusion chromatography and gel electrophoresis. 
The 2CFE 34, 35, and 90 polypeptides putatively mediate fatty acid biosynthesis. 
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Computer-Aided Comparison 

The computer-aided comparison, as described in Example 9 supra, suggests that CFE 34 
(SEQ ID NO:143), CFE 35 (SEQ ID NO: 144), and 90 (SEQ ID NO:199) are 
5 polypeptides which mediate a fatty acid biosynthesis pathway (Figure 19) 

The comparison suggests that CFE 34 is a malonyl CoA:ACP transcylase, which 
catalyzes the reaction in which malonyl CoA and acyl carrier protein (ACP) are 
converted to malonyl-ACP and CoA. Thus, the CFE 34 polypeptide may be a homologue 
10 ofE. coliFabD. 

The comparison suggests that CFE 90 is a 3-oxoacyl-ACP synthase II (beta ketoacyl- 
ACP synthase II) which catalyzes the reaction in which malonyl-ACP is converted to 
beta aceto acetyl-ACP. Thus, the CFE 90 polypeptide may be a homologue of E. coli 
15 FabF. 

The comparison suggests that CFE 35 is a 3-oxoacyl-ACP reductase (beta aceto acetyl 
ACP reductase) which catalyzes the reaction in which beta-keto-acetyl-ACP is converted 
to beta-hydroxy-acetyl-ACP. Thus, the CFE 35 polypeptide may be a homologue of E. 
20 coliFabG. 

Size Exclusion Chromatography 

The estimated molecular 7 weights of 2CFE 34 (SEQ ID NO:361), 2CFE 35 (SEQ ID 
25 NO:362), and 2CFE 90 (SEQ ID NO:413) were determined using the Biosil SEC-125 
HPLC Gel Filtration column as described in Example 8, supra 

The results suggest that 2CFE 34 polypeptide is a monomeric protein (33,093 Da), 2CFE 
35 is a trimeric protein (25,758 Da; approximately 85%), and 2CFE 90 is a dimeric 
30 protein (43,930 Da). Selected eluted samples of 2CFE 34 were electrophoresed on a 
polyacrylamide gel (Figure 20). 
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Biochemical Assay: CFE 34 

The function of 2CFE 34 was determined by performing various biochemical reactions. 
5 To determine whether 2CFE 34 catalyzes the convertion of malonyl-CoA to malonyl and 
CoA, the following reaction was performed. 

The biochemical reaction was performed in the presence of acyl carrier protein. The 
reaction included the following: 10 (aM I4 C labeled malonyl-CoA, 20 ACP, 30 \M 

10 2CFE 34 (e.g., FabD) in 20 mM Tris-Cl, pH 8.0 and 5 mM DTT in 300 jal volume. The 
reaction was performed at room temperature (e.g., approximately 24 degrees C) for 30 
minutes. The reaction was terminated with the addition of 45jal of 0.5% TFA. The 
labeled reaction was injected onto a MonoQ 5/5 column on a Gilson HPLC. Detection 
was performed by monitoring the radioactivity of the continuous flow-through of the 

15 HPLC effluent. Chromatography was performed using a buffer gradient for column 
elution. Buffer A included 20 mM Tris-Cl, pH 8.3. Buffer B was the same as Buffer A 
and included 1 M NaCL The program was held at 90% A, 10% B for 10 minutes 
followed by a linear ramp to a final mix of 50% of each Buffer A and B over 10 minutes. 

20 The substrate (e.g., 14 C malonyl-CoA) eluted at 9.9 minutes, the product (e.g., 14 C 
malonyl-ACP) eluted at 14.3 minutes. The results indicate that CFE 34 catalyzes the 
conversion of malonyl-CoA and acyl carrier protein (ACP) to malonyl-ACP and CoA. 

EXAMPLE 15 

25 

The following provides a description of the methods used to characterize CFE 
polypeptides 40, 41, and 46. 
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Computer-Aided Comparison 

The computer-aided comparison, as described in Example 9 supra, suggests that the CFE 
40 polypeptide (SEQ ID NO: 149) is a phosphomethylpyrimidine (HMP-P) kinase 
5 involved in thiamine biosynthesis. 

- The comparison, as described in Example 9 supra, suggests that the CFE 41 polypeptide 
(SEQ ID NO: 150) has a GTP-binding motif and may be a protease. 

10 The comparison, as described in Example 9 supra, suggests that the CFE 46 polypeptide 
(SEQ ID NO: 155) has an ATP-binding motif. 

Affinity Purification of CFE 41 

15 The large-scale method described in Example 5 supra (e.g., IPTG-induced protein 
production) was used to prepare a sample of 2CFE 41 polypeptide (SEQ ID NO:368). 
The sample was affinity purified using the Ni-NTA method described in Example 6, 
supra. The eluted fractions were loaded onto and run on a 12% SDS-PAGE gel (Novex) 
(Figure 21). 

20 

Circular Dichroism and Circular Dichroism Thermal Melt Analysis 

Circular dichroism and circular dichroism thermal melt methods were performed using 
JASCO instrumentation. The concentration of the isolated 2CFE 40 (SEQ ID NO:367) 
25 was approximately 21 )iM, in a 0.1 cm pathlength cell at 210 m The circular 
dichroism spectrum suggests that this preparation of 2CFE 40 had mixed alpha and beta 
secondary structure. The circular dichroism thermal melt spectrum suggests that 2CFE 
40 has a T m of approximately 67 degrees C The 2CFE 40 polypeptide precipitates at 
approximately the T m . 
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The concentration of the isolated 2CFE 41 (SEQ ID NO:368) was approximately 70 jaM, 
in a 0.02 cm pathlength cell. The circular dichroism spectrum suggests that this 
preparation of 2CFE 41 had mixed alpha and beta secondary structure, with a greater 
5 percentage of alpha structures. The circular dichroism thermal melt spectrum suggests 
that 2CFE 41 has a T m of approximately 38 degrees C. The 2CFE 41 polypeptide 
precipitates at approximately the T m . 

The concentration of the isolated 2CFE 46 (SEQ ID NO:373) was approximately 23 fiM, 
10 in a 0.1 cm pathlength cell at 280 nm. The circular dichroism spectrum suggests that this 
preparation of 2CFE 46 had mixed alpha and beta secondary structure. The circular 
dichroism thermal melt spectrum suggests that 2CFE 46 is highly stable at elevated 
temperatures. At 90 degrees C, the 2CFE 46 polypeptide exhibited only a 27% loss in 
signal and the polypeptide remained soluble. 

15 

Capillary Electrophoresis 

Capillary electrophoresis was performed on samples of purified 2CFE 40, 41 and 46. 
The electropherograms of 2CFE 40, 41, and 46 are shown in Figure 22. 

20 

EXAMPLE 16 

The following provides a description of methods that can be used to characterize CEG 
polypeptides (e.g., CFE polypeptides). 

25 

Computer- Aided Compilation 

Computer-aided compilation of bacterial metabolic pathways may be analyzed using 
Pathway Tools from Doubletwist, based on the EcoCyc system (Karp P.D., et al., 1999 
30 Nucleic Acids Res. 1999 27(l):55-58). This analysis may be used to predict which CFEs 
mediate various steps of the pathways. This information may be used in combination 
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with the results of a binding reaction which identifies a ligand or substrate that binds with 
a CFE polypeptide. 



Identifying the Function of a CFE Polypeptide 

The function of a CFE polypeptide may be identified by identifying a ligand or substrate 
which binds with the CFE polypeptide. The ligand or substrate may be identified using 
fractionation and affinity capillary electrophoresis methods. The following method is 
based upon the assumption that the bacterial cell lysate includes the ligand or substrate. 

A bacterial host cells carrying an endogenous (e.g. native) CFE gene or carrying a 
recombinant vector which includes a CFE gene may be cultured so that the CFE 
polypeptide is produced by the cell. The cells may be ruptured in order to obtain the cell 
lysate. The cell lysate may be fractionated using HPLC technology. The HPLC fractions 
may be reacted with a CFE polypeptide in a binding reaction, and the binding reaction 
may be analyzed by affinity capillary electrophoresis methods. The ligand or substrate 
which reacts with the CFE polypeptide may be identified using mass spectrophotometry 
methods (in "Mass Spectrometry" 1990 eds. McCloskey, J. A., in Methods in 
Enzymology volume 193; Henion, J., et al., 1993 "Mass Spectrometric Investigations of 
Drug-Receptor Interactions" Ther. Drug Monit. 15:563-569; Loo, J. A., et al., 1999 
"Application of Mass Spectrometry for Target Identification and Characterization" Med. 
Res, Rev. 19:307-319; Nguyen, D. N., et al., 1995 "Protein Mass Spectometry: 
Applications to Analytical Biotechnology J. Chromatogr. 705:2 1-45). 

EXAMPLE 17 

The following provides a description of nuclear magnetic resonance (NMR) spectroscopy 
methods that were used to characterize CFE polypeptides. 

High resolution NMR spectroscopy was applied to 15 N-labled, l3 C/ 15 N-labeled, 
2 H/ 13 C/ ! ^-labeled, and type-specifically isotopically labeled CFE polypeptide samples 
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in the solution state for the following purposes: to assess various aspects of the structural 
state, e.g., foldedness, structural integrity; to refine a previously determined experimental 
structure of a close sequence homologue; to refine a homology-modeled structure; to 
assess the potential for a CFE polypeptide to bind small molecules; and to identify small- 
molecule pharmacophoric fragments that bind specifically to the CFE polypeptide 
(''Nuclear Magnetic Resonance" 1994 eds. James, T. L. in Methods in Enzymology 
volume 239). 

The NMR analysis includes screening both a compound deck of approximately 4,500 
commercially available, structurally and chemically diverse compounds (the small- 
molecule pharmacophore deck) and a compound deck of proprietary, known, anti- 
microbial compounds (anti-microbial deck) against the CFE polypeptides (i.e., target 
polypeptides) to determine, either based upon perturbations to the chemical shifts of the 
amide proton and/or nitrogen resonances, as measured from a two-dimensional proton- 
nitrogen heteronuclear single-quantum correlation spectrum (2D screening method), or 
based upon increases in the linewidth of the compound's proton resonance(s), as 
measured by a one-dimensional Ti p spin-lock difference spectrum (ID screening 
method), both whether a compound binds to a CFE polypeptide and, in the case of the 2D 
screening method, where the compound binds on the CFE polypeptide. 

Isotopic Labeling of CFE Polypeptides 

BL21-DE3 E. coli bacteria are transformed with the CFE expression vectors. Expression 
takes place between 20°C and 37°C in minimal media containing [ l5 N]-ammonium 
sulfate as the sole nitrogen source and either glucose, [ 2 H]i3-glucose, or [ l3 C]6-glucose as 
the sole carbon source. Glucose is used for. preparing uniformly 15 N-labeled and 2 H/ 15 N- 
labeled CFE polypeptides. [ 2 H]i3-glucose is used for preparing type-specifically X UI X3 C- 
labeled, uniformly 15 N-labeled CFE polypeptides. [ I3 C]6-glucose is used for preparing 
i3 C/ 15 N-labeled CFE polypeptides. The minimal media is prepared in 100% H 2 0 for 
expressing both uniformly I5 N-Iabeled and uniformly 13 C/ 15 N-labeled CFE polypeptides; 
the minimal media is prepared in 95% D2O (deuterium oxide) and 5% H2O for expressing 
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both type-specifically l H/ I3 C-Iabeled, uniformly 15 N-labeled. and just uniformly 2 H/ 15 N- 
labeled €FE p4 In the case of type-specifically 1 H/ 13 G-labeled, uniformly 15 N- 

labeled CFE polypeptides, 40 mg/L of protonated and uniformly - V^N-labeled 
isoleucine, valine! and leucine amino acids are added to the minimal media. 

5 , 

NMR Screening 

Compounds in the antl-microbial deck are pre-dissolved to a target concentration of 16 
mM in deuterated DMSO (dimethylsulfoxide) with each deck well containing only one 
10 compound. Compounds in the small-molecule, pharmacophore deck are pre-dissolved in 
deuterated dmso to a target concentration of 50 mM in groups of 8, i.e., each deck well 
contains 8 unique compounds with each compound at a target concentration of 50 mM. 

3.5 pi of compound is placed at the bottom of a well in a 96-well, screening plate. This 
15 well will be referred to as the compound screening well. Each compound screening well 
contains solution from only one deck well 166.5 |il of buffer is added to each compound 
screening well. 170 \x\ of a CFE polypeptide solution, initially at a concentration ranging 
from 200-300 pM, is added to each compound screening well; the contents of that well 
are then thoroughly mixed. The control screening well contains only 3.5 ^1 of deuterated 
20 dmso. The screening plate is then centrifuged in a bucket rotor for 15 minutes at 3,500 
rpm to insure that all particulate matter is at the bottom of the well. 

The 2D screening method requires a single control screening well in which the compound 
solution consists only of deuterated DMSO. The ID screening method requires a control 
25 screening well for each compound screening well. In the case of the ID screening 
method, the control screening well is prepared identically to the compound screening 
well except that the 1 70 jal of a CFE polypeptide solution is replaced by 1 70 of buffer. 

The screening plate is covered with aluminum foil and placed onto a rack of a Gilson 
30 liquid handler. The Gilson liquid handler, under computer control by the NMR host/data- 
acquisition software, is responsible for removing each sample from the screening plate, 
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injecting the sample into a high-resolution, ! H/ i5 N double-resonance NMR flow-probe, 
removing the sample from the flow-probe, and dispensing it back into the screening plate 
well from which the sample was originally removed. NMR data are collected on the 
sample while the sample resides in the NMR flow-probe. The type of NMR data 
5 collected depends upon whether the 2D or ID screening method is being used. 

Dp.termininfl Str uctural Characteristics of a CFE Polypeptide 

In assessing various aspects of the structural state of a CFE polypeptide, NMR was used 
10 to provide the following information. The proton ID spectra and proton-nitrogen 2D 
correlation NMR spectra were used to assess the overall foldedness of a CFE polypeptide 
without actually describing in detail that folded state. Unfolded and substantially 
misfolded proteins produced distinct signatures in these two types of NMR spectra. 

15 The chemical shift of most protein nuclei in either the set {H N , Ha, Hp, C, C a , Cp, N} or 
the set {H N , C\ C a , Cp, N} for perdeuterated (e.g., 2 H-labeled) proteins were determined 
by procedures well known in the art that involve collecting up to 10 triple-resonance 
NMR data sets. The protein secondary structure was delineated as either helical, turn or 
extended (e.g., p-sheet) by measuring A(5 Ca - 5 C p), A5C, and A8 Ha where 5 refers to the 

20 chemical-shift value and A refers to the difference between chemical-shift values 
measured in this protein and those measured for the same residue type in a random-coil 
(unstructured), tetrameric peptide. 

This secondary-structure profile was generated in approximately 2-3 weeks per protein. 
25 The secondary-structure profile was used to confirm the functional identity of a protein. 
It was also used to refine the list of possible functional identities of folds, predicted by 
various computational techniques including fold recognition which is associated with a 
protein or polypeptide. 
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NMR was used to generate folds of proteins or polypeptides for which both no structure 
was known of a sequence homologue and no structural homologue was discernible in the 
PDB by fold recognition techniques. 

5 Refining a Structural Model 

Nuclear Overhauser (NOE) data were used to refine both homology-modeled structured 
and previously determined experimental structures of close sequence homologues. This 
process took approximately 2-3 weeks per structure. 

10 

The CFE 88 polypeptide was characterized by NMR analysis to establish its secondary 
structure. The NMR data was used to filter the computer-aided threading analysis. The 
NMR-determined secondary structure for CFE 88 suggested that CFE 88 is structurally 
similar to 4-aminoimidazole carboxylase. 

15 

The characteristics of other CFE polypeptides were analyzed by NMR methods. A 
computer-aided threading analysis revealed that the N-terminal domain of the protein 
EGA, which both binds and hydrolyzes GTP, was both structurally similar and 
sufficiently similar in sequence to CFE 52 to suggest that CFE 52 had a similar function. 

20 

The NMR data of CFE 103 suggests that this polypeptide is unfolded. Circular dichroism 
spectra, as a function of temperature, also indicated that CFE 103 was unfolded. 

The CFEs 2, 42, 43, 68 and 88 polypeptides were tested for their ability to bind potential 
25 inhibitor molecules by screening both the anti-microbial deck and the small-molecule, 
pharmacophore deck. CFE 34 was tested for its ability to bind potential inhibitor 
molecules by screening the anti-microbial deck. 
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NMR-based screening was used to measure binding against both the small-molecule, 
pharmacophore deck and the anti-microbial deck. Binding data from these screens 
5 allowed assessment of the propensity of a protein to bind small molecules. The binding 
data was also used to identify sites on the protein which are capable of binding small 
molecules. The binding data was also used to identify common pharmacophores among 
the compounds which bind. 

10 Reverse screening refers to a process whereby known anti-microbial compounds, the 
microbial target of which is unknown, are screened by a general method, e.g., binding as 
assessed by NMR, to find a physical interaction with polypeptide targets previously 
determined to be essential to the bacteria (i.e., the CFEs). The reverse screening method 
was used to determine which CFE polypeptides bind to which compounds in the anti- 

15 microbial deck. The reverse screening method included the following. The compounds 
in a proprietary compound deck were screened for Minimal Inhibitory Concentration 
(e.g., MIC). The compounds exhibiting antimicrobial activity were designated active 
compounds. The CFE polypeptides were screened to determine which polypeptide bind 
to which active compounds. Hie CFE polypeptides which bound to the active 

20 compound(s) were confirmed, where possible, i.e., in cases where an in-vitro assay was 
possible to construct, as being inhibited in their function as a polypeptide by the active 
compound(s) by examination of the inhibition profile of the compound(s) against the 
CFE polypeptides. For additional confirmation, the effect of the compound on the 
microorganism harboring the CFE polypeptide was monitored (e.g., whole cell assays). 

25 The structure of the active compound was used as a basis to generate chemically-related 
compounds by iterative synthesis. The chemically-related compounds were tested in a 
screening assay for binding with CFE polypeptides. The active compounds and the 
chemically-related compounds of interest were the compounds which exhibited an 
increase in binding affinity for a CFE polypeptide and/or exhibited drug-like properties. 

30 
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The results of the reverse screening are as follows. 127 compounds from the proprietary 
compound deck exhibited anti-Microbial activity. 94 of these active compounds were 
selected based upon both lack of cytotoxicity and lack of excessive hydrophobicity. 
These 94 compounds were soluble to 16 mM in deuterated DMSO; these compounds 
5 were also deemed to be sufficiently soluble in aqueous buffer for both the 2D and ID 
NMR screening methods. 

This subset of 94 compounds was used in an NMR-Based screen to determine which 
compound binds to which CFE polypeptide. The CFE 42 polypeptide bound two 
10 different compounds with K d 's in the range of 0.2 to 1 mM; the CFE 43 polypeptide 
bound one compound with Kd ~ 30-50 fiM; the CFE 34 polypeptide bound 13 
compounds, one of which inhibited the polypeptide function with IC 5 o < 10 fiM. 

The enzyme assay used to confirm the NMR results which suggested CFE 34 interaction 
15 with the compounds included the following: 10 ^iM 14 C-labeled malonyl CoA; 20 nM 
ACP, 30 pM CFE 34; 20 mM Tris-Cl, pH 8.0; 5 mM DTT; in the presence of absence of 
50 |iM of a compound solubilized at 40 mM in 100% DMSO and dilute 100-fold into 
10% DMSO and further diluted 8-fold for a final concentration of 50 |aM in 1.25% 
DMSO. The reaction was performed at room temperature, the reaction was stopped with 
20 the addition of TFA. Two hundred jil of the reaction was injected onto a Mono Q 5/5 
column. The chromatography conditions included: A) 20 mM Tris-Cl, pH 8.3; B) 20 
mM Tris-Cl, pH 8.3, 1 M NaCl. Hold 10% B for 5 minutes, linear gradient from 10% B 
to 50%B in 10 minutes, back to 10% B in 1 minute, hold for 14 minutes to re-equilibrate. 
The reaction substrate ( I4 C- malonyl CoA) eluted at 9.9 minutes, the reaction product 
25 ( 14 C-malonyl ACP) eluted at 14.3 minutes. 
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What is claimed is: 

1. An isolated nucleic acid molecule encoding a polypeptide which is (1) essential 
for the viability of a bacterial cell and (2) has at least any one of the functions of a 
pantothenate kinase, a Holliday Junction branch migration protein, a single 
stranded DNA binding protein, a phosphoglucosamine mutase, an 
acetyltransferase, an uridylyltransferase, a malonyl CoenzymeA:ACP transcylase, 
a 3-oxoacyl-ACP synthase II, a ' 3-oxoacyl-ACP reductase, a 
phosphomethyipyrimidine (HMP-P) kinase, a GTP binding protein, a ATP 
binding protein, or a 4-aminoimidazole carboxylase. 

2. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:97 or Figure 115 and wherein the polypeptide is a 
pantothenate kinase. 

3. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:35, Figure 60, SEQ ID NO: 19, or Figure 44,and wherein 
the polypeptide is a Holliday Junction branch migration protein. 

4. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:8 or Figure 33 and wherein the polypeptide is a single 
stranded DNA binding protein. 

5. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:3 or Figure 28 and wherein the polypeptide is a 
phosphoglucosamine mutase. 

6. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:82 or Figure 103 and wherein the polypeptide is a 
acetyltransferase. 
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7. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:82 or Figure 103 and wherein the polypeptide is a 
uridylyltransferase. 

8. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:30 or Figure 55 and wherein the polypeptide is a 
malonyl CoenzymeA:ACP transcylase. 

9. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:86 or Figure 107 and wherein the polypeptide is a 3- 
oxoacyl-ACP synthase II. 

10. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:31 or Figure 56 and wherein the polypeptide is a 3- 
oxoacyl-ACP reductase. 

11. The isolated nucleic acid molepule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:36 or Figure 61 and wherein the polypeptide is a' 
phosphomethylpyrimidine (HMP-P) kinase. 

12. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:37, Figure 62, SEQ ID NO:48, or Figure 73, and 
wherein the polypeptide is a GTP binding protein. 

13. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:42 or Figure 67 and wherein the polypeptide is a ATP 
binding protein. 
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14. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO: 84 or Figure 105 and wherein the polypeptide is a 4- 
aminoimidazole carboxylase. 

15. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:48 or Figure 73 and wherein the polypeptide is a GTP 
binding protein. 

16. An isolated nucleic acid molecule encoding a polypeptide which is essential for 
the viability of a bacterial cell, the nucleic acid molecule comprising a sequence 
shown in any one of SEQ ID NOS : 1 - 1 1 3 . 

17. An isolated nucleic acid molecule encoding a polypeptide which is essential for 
the viability of a bacterial cell, the nucleic acid molecule comprising a sequence 
shown in any one of Figures 26-130. 

18. An isolated nucleic acid molecule encoding any one of a polypeptide designated 
CFE 1-117 having the amino acid sequence shown in SEQ ID NO:l 14-226. 

19. An isolated nucleic acid molecule comprising a nucleotide sequence which is 
complementary to the nucleotide sequence of claim 1, 16, 17 or 18. 

20. The isolated nucleic acid molecule of claim 1, 16, 17 or 18 which is DNA or 
RNA. 

21. The isolated nucleic acid molecule of claim 20, which is labeled with a detectable 
marker. 

22. The isolated nucleic acid molecule of claim 21, wherein the detectable marker is 
selected from the group consisting of a radioisotope, a fluorescent compound, a 
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bioluminescent compound, a chemiluminescent compound, a metal chelator and 
an enzyme. ; 



23. A vector comprising the nucleotide sequence of claim 1, 16, 17, or 18. 

5 

24. A host-vector system comprising the vector of claim 23, in a suitable host cell. 

25. The host- vector system of claim 24, wherein the suitable host cell is selected from 
a group consisting of a yeast cell, a plant cell, and an animal cell. 

10 

26. The host- vector system of claim 24, wherein the suitable host cell is selected from 
a group consisting of an Escherichia cell, a Bacillus cell, a Pseudomonas cell, a 
Streptococcus cell, and a Streptomyces cell. 

15 27. An isolated polypeptide which is essential for the viability of a bacterial cell 
comprising the amino acid sequence as shown in any one of SEQ. ED NOS: 114- 
226. 

28. An isolated polypeptide which is essential for the viability of a bacterial cell 
20 encoded by the isolated nucleic acid molecule of claim 1, 16, 17, or 18. 

29. The isolated polypeptide of claim 27 or 28 which is a fusion polypeptide. 

30. A method for producing a polypeptide having the.amino acid sequence of any one 
25 of SEQ ID NOS: 114-226 or a polypeptide encoded by the polynucleotide 

sequence as shown in any one of Figures 26-130, comprising: 

a) culturing the host-vector system of claim 24 under suitable conditions so as to 
produce the polypeptide; and 

b) recovering the polypeptide so produced. 

30 
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3 1 . A polypeptide produced by the method of claim 30. 

32. A ligand which binds the polypeptide of claim 27 or 28. 

33. The ligand of claim 32 which is an antibody or an immunologically active 
fragment thereof. 

34. The ligand of claim 33, wherein the antibody is a monoclonal antibody. 

35. The ligand of claim 32 which is a diazalactone. 

36. The ligand of claim 35, wherein the diazalactone comprises the structure: 

O 



38. The ligand of claim 37, wherein the TV-protected amino acid . comprises the 
structure: 




37. The ligand of claim 32 which is a JV-pcotected amino acid. 




O- 



39i The ligand of claim 32 which is an azabicyclodiene. 
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40. The ligand of claim 39, wherein the azabicyclodiene comprises the structure: 




.5 41. The ligand of claim 32 which is an alkaloid. 

42. The ligand of claim 41, wherein the alkaloid comprises the structure: 




10 

43. The ligand of claim 41, wherein the alkaloid comprises the structure: 




116 



WO 01/49721 



PCT/USOO/35604 



44. The ligand of claim 41, wherein the alkaloid comprises the structure: 




45. The ligand of claim 41, wherein the alkaloid comprises the structure: 
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46. The ligand of claim 41, wherein the alkaloid comprises the structure: 



CI 




47. A method for detecting the presence of the polypeptide of claim 27 or 28 in a 
sample, comprising contacting the sample with a ligand which binds the 
polypeptide and detecting the binding of the polypeptide with the ligand in the 
sample. 

48. The method of claim 47, wherein the detecting comprises: 

a) contacting the sample with the ligand; and 

b) determining whether a polypeptide-ligand complex is so formed. 

49. The method of claim 47, wherein the sample is a cell, a tissue, or a biological 
fluid. 

50. The method of claim 47, wherein the sample is blood, serum, a swab from nose, a 
swab from ear, or a swab from throat 

5 1 . The method of claim 47, wherein the ligand is a diazalactone. 
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52. The method of claim 51, wherein the diazalactone comprises the structure: 




53. The method of claim 47, wherein the ligand is a JV-protected amino acid. 

54. The method of claim 53, wherein the ^-protected amino acid comprises the 
structure: 




55. The method of claim 47, wherein the ligand is an azabicyclodiene. 



56. The method of claim 55, wherein the azabicyclodiene comprises the structure: 




57. The ligand of claim 47 which is an alkaloid. 
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58. The ligand of claim 57, wherein the alkaloid comprises the structure: 




59. The ligand of claim 57, wherein the alkaloid comprises the structure: 




60. The ligand of claim 57, wherein the alkaloid comprises the structure: 




120 



WO 01/49721 PCT/USOO/35604 



61. The ligand of claim 57, wherein the alkaloid comprises the structure: 




62. The ligand of claim 57, wherein the alkaloid comprises the structure: 




63. A method for detecting the presence of a target nucleic acid molecule as shown in 
any one of SEQ ID NOS:l-113 in a sample, comprising contacting the sample 
with the complementary nucleic acid molecule of claim 19 and detecting the 
binding of the target nucleic acid molecule with the complementary nucleic acid 
molecule in the sample. 
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64. The method of claim 63, wherein the detecting comprises: 

a) contacting the sample with the complementary nucleic acid molecule; and 

b) determining whether a complex comprising the target nucleic acid molecule 
and the complementary nucleic acid molecule is so formed. 

65. The method of claim 63, wherein the sample is a cell, a tissue, or a biological 
fluid. 

66. The method of claim 63, wherein the sample is blood, serum, a swab from nose, a 
swab from ear, or a swab from throat. 

67. A pharmaceutical composition comprising the nucleic acid molecule of claim 1, 
16, 17, or 18. 



68. A pharmaceutical composition comprising the polypeptide of claim 27 or 28. 

69. A pharmaceutical composition comprising the ligand of claim 32. 

70. A method for determining whether a genomic nucleotide sequence of interest is 
. essential for viability of a bacterial cell, comprising 

a. integrating an exogenous nucleotide sequence into the genomic nucleotide 
sequence of interest, wherein the exogenous nucleotide sequence 
comprises a portion of an open reading frame of the genomic nucleotide 
sequence of interest, and 

b. determining whether the cell having the genomic nucleotide sequence of 
interest so integrated is viable. 

71 . The method of claim 70, wherein the portion of the open reading frame comprises 
about 200 to 500 base pairs in length. 
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72. The method of claim 70, wherein the exogenous nucleotide sequence further 
comprises a nucleotide sequence conferring a selectable phenotype to the cell 
having the genome so integrated. 

5 ' 

73. The method of claim 70, wherein determining comprises selecting the cell having 
the genome so integrated in the presence of a selection agent. 

74. The method of claim 73, wherein the selection agent is chloramphenicol. 

10 

75. A nucleotide sequence of interest which is essential for viability of a bacterial cell 
isolated by the method of claim 70. 

76. A bacterial cell comprising an exogenous nucleotide sequence integrated into the 
15 genomic nucleotide sequence of interest, generated by the method of claim 70. 

77. A method for determining whether a genomic nucleotide sequence of interest 
resides within an operon, comprising 

a) integrating an exogenous nucleotide sequence into the genomic nucleotide 
sequence of interest; and 

b) determining whether the cell having the genomic nucleotide sequence of 
interest so integrated is viable, and wherein the exogenous nucleotide 
sequence lacks an expression regulatory sequence. 

25 78. The method of claim 77, wherein the exogenous nucleotide sequence further 
comprises a nucleotide sequence conferring a selectable phenotype to the cell 
having the genome so integrated. 

79. The method of claim 77, wherein determining comprises selecting the cell having 
30 the genome so integrated in the presence of a selection agent. 
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80. The method of claim 79, wherein the selection agent is chloramphenicol. 

81. A method for inhibiting a function of a CEG polypeptide which is essential for 
5 viability of a bacterial cell, the method comprising contacting the CEG polypeptide 

with the ligand of claim 32 under suitable conditions thereby inhibiting the function 
of the CEG polypeptide. 

82. The method of claim 81, wherein the function of the CEG polypeptide is selected 
10 from a group consisting of a pantothenate kinase, a Holliday Junction branch 

migration protein, a single stranded DNA binding protein, a phosphoglucosamine 
mutase, an acetyltransferase, an uridylyltransferase, a malonyl Coenzyme A :ACP 
transcylase, a 3-oxoacyl-ACP synthase n, a 3-oxoacyl-ACP reductase, a 
phosphomethylpyrimidine (HMP-P) kinase, a GTP binding protein, a ATP 
15 binding protein, or a 4-aminoimidazole carboxylase. 

83. The method of claim 81, wherein the CEG polypeptide is selected from a group 
consisting of CFE1-1 13. 

20 84. The method of claim 81, wherein the CEG polypeptide is 2CFE 34 shown in 
Figure 55. 

85. The method of claim 81, wherein the CEG polypeptide is 2CFE 43 shown in 
Figure 64. 

25 
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86. The method of claim 81, wherein the CEG polypeptide is 2CFE 34 shown in 
Figure 55 and the ligand is: 




OH 



OH 



87. The method of claim 81, wherein the CEG polypeptide is 2CFE 43 shown in 
Figure 64 and the ligand is: 




N NO r 



NO r 



10 



88. The method of claim 81, wherein the CEG polypeptide is 2CFE 43 shown in 
Figure 64 and the ligand is: 




15 
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89. A method for identifying a ligand in a sample which specifically binds a CEG 
polypeptide, the method comprising: 

a) contacting the CEG polypeptide with the sample under suitable conditions 
so that a complex having the CEG polypeptide and the ligand is formed'; 

b) recovering the complex so formed ; and 

c) separating the CEG polypeptide from the ligand in the complex and 
identifying the ligand so separated. 

90. The method of claim 89, wherein the sample is a tissue or biological fluid. 

91. The method of claim 89, wherein the ligand is an azabicyclodiene. 

92. The method of claim 91, wherein the azabicyclodiene comprises the structure: 



93. The method of claim 89, wherein the ligand is a diazalactone. 

94. The method of claim 93, wherein the diazalactone comprises the structure: 



95. The method of claim 89, wherein the ligand is a N-protected amino acid. 




OH 0 




NO2- 
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96. The method of claim 95, wherein the Af-protected amino acid comprises the 
structure: 




97. The method of claim 89, wherein the ligand is an alkoloid. 

98. The ligand of claim 97, wherein the alkaloid comprises the structure: 




99. The ligand of claim 97, wherein the alkaloid comprises the structure: 
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100. The ligand of claim 97, wherein the alkaloid comprises the structure: 




101. The ligand of claim 97, wherein the alkaloid comprises the structure: 

5 
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The ligand of claim 97, wherein the alkaloid comprises the structure: 



CI 
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ill x 
Atf 3att)tatocagoaattcttgccggtggaact^ 

TT rTAGjLCK^AGCTOATCOACCTAt^ 

AA VATT 3TAGrrGOTGTTCATGQAGACTGGOTTTC3TCA 

TTi ITAA KUACGTATCATCATTAOVM^ 

GC lATGCTTATCGTCCGCTTACTCCAGAGOATATCGTO 

AC VCTT X}CATGATICAGGACAATATOCAACT 

GC jQTT iatactatcgttoaaagtacgaatggtcaatttattacag 
TC- lAGG \caaacacctcaaacattccgttgcaa^ 

GA 3AAC QAAATCTTOACAGATGCATGTAAAATCTTTO 

G{j COAA rACTCAAATCTGAAGATTACAACCGTAACAGATTTC 

GAH'AG 

2CR£z'' M homologue of SEQ. ID NO, 2" 
ArOGCTMCGTAATTATTGAAAA^ 

]tfATCCGTCK:TGGTCGTGC(^ 
:TCTtAACCAAATOGCTK^ 
LGWh^TtGAAAGACATCGA«^ 
JTGATnXCTTGGTTATCCCA 

JA 

aa^gXaaaagcaaaagaaatca^ 

Aq/tGAC! lATGeTCTTAAACACATra^ 
^Cfi;3 "flornologue- of SEQ. ID- NO. 3"/ 

at< gc^Aaatattttgggactgatggagtccgtggagaac^ 
tt* aAct AGGAcarrrrGGAdGCTATGTTcm 

ACC fT0A( ^CACGTaTTTGACKjGGAAATOCIXjGAATCGG 
CAC GTA, 1 ] ACAAACTTGGTGtCCTrGCAA(^CG^ 
CC< GTO1 CATOATTTCTGGrAGCGACAACK 
CTCGAAJ CTAGATCWTGAAAAAGAAGCUUA^ 
TOqTCCAAGTCCAGAAGGCTTAGG 
lAACTGGMCrCCTCTTO^ 
{CGTQAAATCTTTGCUa^ 




CT 
CA 



m 



AAC A$C> ACCTTAATdTTGGTTCAAC^CATCCAGAAGCCCTT^ 
GC! ATTG GTITOGCCTITGATCGAQACAGTO^ 

atc arroj c^GATiAT^ 

TOT 3 ACA ACTCTTATGTCTAA<X^tTGGTTT 
. ACTT 3CAC n^GTGACCGCTACGlTGTTGAAGAMT^ 
Olfa 3TCA CGirATCTTGATGGATTACAATACGACAOOT 
A A T CATC AAGGAAACTGGTAAGAGCTTATCAGAGTTG^^ 

AG! t AAT \TCXX5AGTGGAAAACGTCATGAAGGAAAAGGCCATGGAAGTGCGAGCt 
cgJa OAAC ATGGAAOAAGAAATGGCGGGGAACGGGCGTATCCITGTTX^ 
CW JCOT jTTATGGCAGAAGCGCCTACAACAGAAGAAGTAGACTACTATOT^ 
GTTi JGTG n , GAAATTG(kiAtroACrAA 

2CF^I"bomQiogue of SEQ. ID N<s>, 4" 
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k 4 XiAA \MA^TACTAAITGTAQATGATOAGAAACCMTCTCGGATAm 
L IQGT rATGAAGTTGTAACTGCTTTTAATGGTC^ 
TV TTA-1 TATTCTGGATTTGATGCITtXAGAAATTC 
Q( 'AbTt jTGCCCaTTCTTATGCTITCAGCCAAaGAT^ 
GOAGA ^CTATGTAACGAAACCCTTCrcCAATCGTGAGTO 
fr TCTt AACCrATGCCAGTAGATGGTCAGGAAGCAGATAGTAAACCIX^ 
A< ?AAA rTGTtCCAGAOTCCTACOTGGCTAAAAAATAXGGC^AAGAACrAG 
Qi iGCTT TTGT|fTC^TrrAGCACCGCATACAGGTCAAGn> 
GOGGT XTC^ATTTTGGTOATGT^ 
AC IATA< JGCOCkOCCGACCAGAGTATATC^ 



TCA 



o 



CAA 



2dfc5 "homologue of SEQ. ID NO P 5" 
Jfl GGA> GLAAATTCTCTGTATTGGTTGT 
CC CCCC AXnT0GCA(nTGAAAAAGGTI7G 

CC ACT^CAATQAAATCACAGATGTCCAGTTCACGGACGATG . 
0/ GAG] &ATGCTTTAGTGGTCAATOIX1\TTGA 
AC GmCXSTCrcGGdCAATOATGTCCTC^ 

DC TGOT MtGATTAGCCAGTGGCTCATGAAACGTXjCCCATGAAGAAM 
TA ACTT ^AGCAGAAAATAAaTATGCCATTA^^^ 
GC GATf TXHATCTCGIXXJ^ 

~"~ CQOC TGATtAGAATGTCXTCACTACTTCACGCTT^^ 

TOA<p MCGpATCnTATATTTAC^TACGCCGGGAAlT^ 
Oj 3QCA AAAACCTCAAGTATOTCAGXtCTAAAAAGGAAATCAAGOT 
GC AAAC CCTATTTlTAGGTGGTTTGGGACGCTTrGAC 
TT Tjrn rATAATGAACTCAAACTCCATCGTAGCAAGCTIGAAGGAGCT^ 
TG IHlAACTCTtCTGACA^ 

:C^T rAAAGATAAGACAGACCTAGTCATITCAGGCCTAGGCTG 

}TGTGGGCACCAGAAGGCGTCGCCGTCGTCACAGGAAAAGCA 



a:cat 



'jHomologue 6f SEQ. ID NO. 6" 
AT< HA^fCAGATGATAGTTTGACATTGCAC^ 
ACpAA<?pGATTCACAAT^ 

TTTGCAGGTTTGGAAAGAATTGTGAACTATCT 
K/ACmXHnTGGTTAT^^ 
Qt CGTj CrGGCCAAOAAGGGGATTIXKjTTTITGCTAATGAAC^ 
GC( JAAT< iTCAGrTIGGTCGAAAGGGCTGTrTTG 
A<?< JTOCf ATTGGTICGGTTATCOAAGATGAACCCTTC 
GA' GCq<K:CAl^GGGAACACXSCGCAGCTOTGATO 
G(J* AAG JTCTTTGACXTTCCTC^ 
TX^AGCi TrcAAGGGmCGCTGCGACCCACAAAMTIG 
.' . J< rTGTACCAGCroCCATTCAGGTGGCGCGTCAGCTC 

GA( GCTl AGATTTATGCTTCTAATaATnXKJ 
Ad^TTO TGIvTCGGGCGTCGGTACCAAOCTGATTAC^ 

GAT] GTTGCAATCGAAGATGAAACIX3GTCAGATC 
AA/ AGfC fTCGACGCGAGGTAAGAAGCAOGTGTGGCGC^TTACCA 
OTC ACTA cATCAtnTATGATGOTOT^ 
TA]C AtAC ATCAiAGAAGACGGTTGOTAATm 

taj fact rracaacttgcctagtttgactgacm!ttcaggat^ 
ga|i — 1 , - . . - 

2 era 

ATP 



TGGV CTTC^TIGATMGATGCGCAAGOAAGCCCtTGGTC 



fi • ■ . ... 

I'homologuia of SEQ,' ID NO. 7" 
3CTA n , ATTCMTGGTTTC<nXK}TC^ 
TTGTTGA' TTTGTGACGATTTtAGTAGATGCACGCTTGCCTCTATGT 
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V 

. V 



ATtronjaGTOATAAACCAAAACTCTTOATTrt 

\A V (XX QTCAdTATTrrGMTCACMOGAATCCAGACCCTAGCTAT^ : 
AA AACT TOTA ACAGATGOiGCCAAGAAGC^TGGC^^ 
TCkQM rGAAACCTTGGK/TACCATGATTATCGGGATTCCAAACGCTGGT 
TT 3GCT KrTAAAMGATTGCTGTiXiTTGGAAACAA 
AC CAAl AAAQATCTGGAAATCnTGGATACAC(XjGGGATTCnrr 
CA CTT A \GTTOCM^TTGACTOQACCTATCAAGGATCAGTTGCTrcCT 
AT CAAl rATTjtaAAGAACATTATCCAGAAAAGCTO 
OA AGCC CCTOTGATIATTATAOATATGACttXKXJCCCTCGGTTTC^^ 
TGAAGGAAGTCXMTOATGGaVAAC^ 
UT&A ' 

"homologue of SEQ. ID NO. 8" 



TCrCfTTCG 
GGCAACGA' 



AT MTTUCAATOnOTA^ 
TA3CAGhrGC#ACTTm 
TT TATX AATCmXnTATGTGGCGCX^AA^ 
Af :GG<? [HGACAGOTCGTATCCAGACTCGTAGTTACGAT 
Gft 3GTC STCGCTOAGAATIT^^ 
TX TTCT< ICACfl^CTGCAAACTATTCAGCACCT^ 
CkTTTQ t <\AGCAACAAAC<mTrM 

2.Cf& ,, homol.bgu^ oft SEQ. ID NO- 9" 
ArOA^CGCbTArTACAGAATTAnGAAGATTGACT^ 
CT( 5ATC* jTOAfrrcGCAGGGGCTOTTr^ 
QJl VA04 ^GCTGTCAAGGttiAATATf OAT 
AT{ rCTC TATCjCCCTTTGl^AAGATATCGTC 
3Aa$ IGGAAATCXIAACCAAGIATATGG^ 
CC "AfpiJC rCTOOTTTAGCtAAACGCAlXJGAAAAAATC 
QC % GGG JGGCaTATCGGTAAATTAACAACCATGACCTC 
CtC rTTAI TGCTCKUGOAGGAATTGCCK^ 
rd' ACAeC fGlt}QGCUCACGGTTTOTAGTKiCA 
TT1 AAA> tKiAAGGaArATrGACACTAOOAT^ 
ATX AGt] GACTAGAOATTrroAACTOGCKUAAAAGATGCCm^ 
GTl TGA> CAAATOGOAGCAGOfGCCCTAGCCAAAGCAGtTX^ 
ATC (GCA< KiTCAAATCXK^GGCTTGTTlfcTAAAGA^ 
A<#GaC* XXXJTAaGAAAATTCAAG 

2.Cflho "homologue of SEQ. ID NO. 10" 
ATC ATC^TATTCMGGAATCAA^ 
GG> AGT^ IAGCGAGGATaCC^TTGTTGCTATCA^ 
TTT OT<p UTACCCAGmTCCGAGGTGITCTC^ 
GTI AXCfi AAACCTGAAAAtAAAGGAAAACTGGTCTm 

GC/ AGT3 GTACCAGGCGACCMTTGGTTATGACAGCGACTTTTpTA * 
GTI QAa< CAAAGGCTGAAGTCKJATOGC^ 



;aa 



to 



i 



2CRtilIj>( ,, fromol6gue of,5E0. ID JIO. TJ" • 
ATC ATTA^TCAMTlf ATCAACfTAACTAAG^ 
ACC AAQJ GAAtl^TATO?rrATCCGTaXAACTACATO^ 
GGC AAAV .CGTaATCaUAGATTnXUATAAAAAGCTTCCA^ 



ATG *ZAGa gtgatgmgaattctatgaaaactacatgacagggaotca^ 

GGT [TAT }AGA 5AGTTTOmCTCTCa:TAAAaATCGTGTOGTGGOT 
GCA apW ITAC VGAGTTTGTCAOTKnGGaCATGCAGGCU^^ 
AGC3GG/GCGC ATCGCCGTTATI^AGATGQAAGTTTAGCTTTraTGQT^^ 
TTT< KJC^pAAGvXGAQATroTGGTTATrQGTCGTCATTGGGAAAAOTTOb 
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a J i. tp rlr a t a WArr^ATAATATTTCTOAAOATTO 
^ TA AW CAOCXnTTAAAACAQTQTTTAAQTGGGAACTATAA 



2 CTJll 
ATpAAdTTi 



mh a a rtTArGcTCC^GGGA^ 



ID NO. 12" 



GO. 
AC* 



£ Kt£ -S^atatoSa^aotc*^ 
•#!ATAraSac^^ 



^ IGPAHCTTArACTCCCATC 

.tgJactvla- . ■-• . 

ij S3 A ?OAAWC^G6GGACAATOaA^ 
A& ACTY CATC«AAaAC^^ 



GAAflCTTCAGC 

cgaagaAmgotcggtoaagcksgtk^ttataa 
ctc Acri]rrcTtAA 

A" 



TIG AATflTAAAAOAAAATAGADAACTTfll 

AG/ QTQC TTCf** " " * 

GCit^OGlGTT< 
COA0ATCfGA( 



^ JTTTTOGAOAAGrraCAGAGQCTAGTCTGAGT 

:<tol^CTGTCATT^^ 

TCATATCdGTGAAAATCGfGTAGATAAGTTl^ 
KSCAtrrGATTGGTACXrrTGCAAAGA^ 
TTCGACTCAGTAAAGCTAGCAGiMGAAATTCA AAAAA a^ 
GTAAATATfTCTAAAGAAGAAACCAAAC&^ 

GTTAGGCAGACTAGATAaGA^ 
KJAGTTGAAAGAGATTTTCAAGGCGGCCCAAG 
_ ATGCCTAfriACCGAGTT^ 



VTT3 



GCJACXJAbtG 
CAMTTCSAAA] 
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>?r<iw« "homoloque of SEQ ID NO. 1 5J' 

■ vSyvTTGCTCTAGAAAA^ 

(F Tr^S A G^AGA^aC^ 

S^:£^tSatctaaTO^ 

or*%vt*> - homolociue of- SEQ. ID NO. 16" 

•■ .SffiaAT^Tm* 

S^^gSS^gStiT^^ 
^ SaGCToSSc^ 

tw. GCPttMAAACTCAAGGCTGCtGGTGTTGCGTTCGG 

^ toXSo^oacggaactgam 

^ V " TAGGAGACaCTCGAAAAACQGOTGCCAOTGTGCGTCTQGCTGTCAATCCAGATGTCCTAGTT 
'CACTTATGGCAAGGGTGATGAAAAAATTGCTCGTAACCAKKjTCACTCA^ 

ctgcl^otagcggtaoaagcaggtgccaM 
Aa JSa^tnwcAA^unta^^ 
Igtggaaatctag 



OA* 



Aha* 

TGA 
c4> ATC 
^ Af> 
fci A 



of SEQ. ID NO. 17" 




GAAGAG1CG 
AOArGAcmc, 



£CFiSW \ "homplogue 

TO !GAG ^GTC^TTiTroWr^AATGTA^ 
GG" IGAT HCCAAACATGGATTACCTTTTTGAAAATAGCGAOtCT 
AT< SCOG tfGCCATTOGTGCTCTACCTTA^ 
AC< lATft fAGTtGGCAAAGCTCTrTGTC^AAGGAAATOATGCCGTT 
TTC ATOv XlAATACflGAGATTGATTnGGTGGGACAGTGGTTTCCT^ 
GAI jaOT nGGdAATTGTCTTGAAGACATCGGAAGGAAGCATCGTrTATACA 
AA> iCGG nVUnOAATCXTATGCAACIOATTTTGCnSJTTTQGCAQA 
T<h CCTC ^OIGATICGGCCAATGCAGACAGCAATATI^GGTGGCTAGTOAAAGTGAA 
AGC< AAACTATTGCTOACTQGGAAGGTCGTATCATCGTrGCAGCTGmCCAGTAA 
GCAX ATnTIGACGCTCCGGATAXAA^GQTCOACGTA^ 
3<STCC OCAOAGGGATTOGTCTTAAGAAGriOIGTrrAGCCAACGAAATrC^ . 
TOTCTCGCTnGAAGACCATaAOTTGAtrATT^^ 
TGG1 AAGATGTCGATIGGTCGCCA7CGTrATGTAGAAi\TCA 

accIggtc cGTCTAtixKrrAMaAAoantroxTGcqeGTQTOG 

TTC fOAA ATTGATTACCCAAAGTn*ACATGTAl€AGGG 

fcCCTAAGTACCTCrKXXriX}TC£AAGG<3GA 
JCfllGGClAanCKXaATC^ 

,TO< AGA(?ITrGTTCCAGCTGOATCGGTTTCAGGAGGAGATAT^ 
TT 3GAMTGTTOTTCTTCGTGACXX1TAAGGTCTTG 



VTATTCTCCGiGAAAGTXCAGAATTGATTAACCAAACGGTAGAAGAGTATO 
icrGGGCAGATCTCAAAGGTAAGGTTCGTGACAATC^ 
AC£j\AOdGTCGaCCAGCCATO 
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(Cm9)-"homologue. 91.1^^,10.110.. 18," 

;TTTC/VTCATCTAACGGT(^ 

GCGACnTOGGCGQAGCAGGACATAGCGAG^^ 

OCTTTGACCAGGATCAGAATGGCATTGACMTGCGCAAAM 
JWJTGACCrn'ATCAAGGATA^CTTCCGTCATTTACAGGCACGTT^ 

TTri^ATAAAMGGATGCGCCACTGGACA^ 



AA GGT/ TCTACGTTGATX5CGAC 
AA AAG< CCATCTCTATGC 
TA CAT* 3 AGAAGGOAATGGTGAC 



ff) TAP 

* . ct 



<JC3^GCT 



tTGAA STGG^AATCATTATDACTATCATGATTTG QI^GTATTTTCTrCAAATACGGTG AGGATAAATT 
^\GAGATTOCGCGTAAaATTOAG(^GCGC^^ 
AG CAG/ GATTATCAA<3TTGGTCAAA(XrrGCCAAGGAACTCAAGAAGAAGG^ 
TT «GA JGCTArKXJAATrGAAGTCAAltJATO 
S£iKn«ICICroGATCKItA^ 

AT rGTP lAAGQAAGCTTCAACAGTTOAAGTTCCAAAAQGCTrGCGTTTC^ 

AA 3AT€ OMXTGGTGTCGCOTAAGCCMTCTtGCCAAGTGCGGAAGAGm . 
IAAGTTGCGCGTGGTCAOAAaAATTCACAAGCTCGAGCACCACCAC^ 



cattcAscc 



<2'CFai "homologue of SEQ. ID. NO. 1'9" J 



TC'XSGGA' 




GG< 3AA' 



Ar^OtKGAATmAGATAATGAGATAATGGGGGATGA 
TA' TTAk JGTGAATATATCGGACAGGATAAGGTGAAGGA^ 



TGAaGCGCT<KjA1X^K^GCTCTTAT^ 
TCtTATlGCa^CGAACTGG 
GT< K3TA< iCTATTrrOAATGAGTTAGAGCCTO 
Gfl 1AQ1< KSAAfeAQQTGCflTTATAGTGCTA TO 
QG :AGT 'GTAGfGTTCATTTGGAGTTACCAC^ 
CTXXZAA' OCGQTAGGGGCAGGTTrrGGGATTAC^ 

TP JTCG^GCGGACGGCAGATATTmGAGATGGA^ 

;< iTOTTGtiGACCCCTCGTATTGCCAATC 
GGGjTAATI^TGATATTATTACCGATAAGGCTTTGACT 



aL lGOT TTATCATGCGGACACGGTCTGGACGGGTG 
AAtACA<pTGAAAAAGC<MCCGCACTCGA(^CC^ 

2CEtil4 "homologue of SEQ. ID . NO. 20" 
ATC AGf ATtiTTmAGATAC^GCTAAG 
TTC OTCC TGAASAAATATGTCCCrA^ 

en cgttstagIacgmggactacgtaccttgatggatt^ 

G?C AAAi AGGGATGACCAAAGGGATGCATGGtCOTGGTGCT^ 

OQAC TGTT£GTCATSCGGAGACT^ 
CGI TQCC CACGiGTGGTCXjTGGTGGACGTGGAAATATI^ 
ATC AAAATGGAGAACCAGGTCAGGA^ 
U OCTl TTAA TAGGATTCCCATCTGTAGCXUAGTCAA^ 
^ TGC TOCC rAC^CTTTACCACtAlTOTAGCAAATITA 
to. CAC TAGC CGACTTGCCAGGT7TGATTGAAGGGGCT 
£ TCA 2ATC lAGCGTACACGTGTTATCCTTCACAT^ 
AGCAtbccrAGGTAiX^TAAAOAGCTC 
TGT VACT \AtAAGATGGA6tttKXH^ 
AAA TTAT 3ATGAATTTGAAGAOTTACCAGCTATCntX 
ACA CTTT rAGATGCTAGAGCTGAATTCTTAGACAAGACACCAGAA 
TGG \AG/ AGAAGTTTACTATGOATn^ACGAAGAAGAAA^ 
CGA 2ATG QGTA CTITC1WTG AAAAACTGAT^ 
GTC ttGAAATT tecCCGTtAGCTTCGTGGTATGGGGGTTGA 
ATG^GAjTTO 
tga 
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is • 



2Cjj£25i i'bomjilp.gueL pJL.SEQ. I.D 4 .NQ- -2,1" . 
ATsXvOTACTirfAATGTTCGGAAAATCGTTAATAC^ 
TO WXJC ATTTTGCAGAAGAACGGTTTAAAAAAGGAGCTGAGCrc 
G rGCA ^ACApTCACCATCGCTAGCCACCGTAAACAGAAGAACTTTGACA 

A :ca+ vtcaJstactatcgaaaagtacaagggataca^ 

CT ^GAC CUTG*5FroAATTTTACT^ 
OA LOCA TCAAjOGAAATCXnGCAACCAGGTGCTAATC 
AT ribc rmACCTTATATCCCACCAGTGGTTCTCAATGTTO 
AWTrA'lMO|Km 

^homologue of "sEQ.ID-No" 22" 
Al^dATTOATAITO 

TO 3AGA AAAaGGGCTGTTGGATATCCAGTATCATAATTTTCGAG 
Cfi TGAT 3AGC^XTAGGGAGGCGGTCAGGG^TGTTGCTCAGAGlrACAACCT 
CTj \TtGi iAAAQAAAAATCCGCGCGTTATTCTCXnXX}ATCCT 
T& LA)3A rTTCHpCTCAAGAGGAAGAGCTAATCTTTATCTGTG 
AA 3AC$ ntKxXAACAGATGAGATITCCCTAGGCGACrATO 
C& UGA nrGAWJCTA(^GrrCKKrTGATtCCAGA 
TT TTC7 TCAGGTCTTTTAGAATATCCTCAGTACA^ 
An iTAt GATGtAGTGGTtUCCATGAMAGAra 
C& IGCG ^AGGCCGGAmACTTGAACATTATCAACTGaCAGTAGM 
C^^aA^CAAAGAAGOWCCiX^CT . 



2,Ct , ih'/ "hdraologue. of SEQ. ID NO. 23" 

at< iati&aagcjjaagtaaattaaaagctg^ 



bo 



5' 



AQ<XC 
AT<t< 



>TC AAA 1 



ctc^ttmtacatttgacacaagctaccgtccagaggaaaaatttg 
tacttotacaaaatggatgacac^gc^^ 
tagtpaatgttgaaAacgaattgcttx^catccttgaaaact 

k GTGATCG<JTGTCACCGTTOCTACtACTC 
&GTGuTACTGTTACAGGTTCTGGTA^ 
CACfAtr t^TCGAAGCAGGACAAAAACTCGTTATCAAC^CTGCAG 

cgaG<^<:gAC(^cx^oc^cx^ctga : 



^nh8l M homologue of SKQ. ID WO. 2.4" 
• ATC GCAftTTGi^OITrAACAO/U 

CTC AATC TGATGTCCAAGfAGGCAACXAAAGAAATTCGCTTG 
TGI TQTA AAG^tnTTATCAAGAAAGTTAGTGAGCGTGCAGTCGGGCATC 

GCOC AACAOATTATTAAAAIXXITTGATC^ 
TTA IX}A^ GTC^^MGATriX^^ 



ATT1ATC?AAA' 



TGC TGGT AMTTGGCCAACAAACTCAAGAAAGAAGAAAATGCTCGTCC^ 

I CGTC CAGCtXXCATTGAOdAGCTTAAGACq^ 
AAC|A0A> lGT AC^GCTCrrTGAGA 

A rroAfrACTGCGGGTCGTnX3CAGA 
C rCAACCAAATXlAAATCfTGCTTCTC 
GCdTGAC TTTaATGCTCAGTTCGAAGTGAC^^ 
GGT GCTCCTCTOTCTC^ 

CGG ACA1 TGaAACCTTCCACCGAGACCGCATGTCT 

GAUGACAAAGjHTCTCAGG^ 

CAjC Ctlt SAttt rAATGATTTCATTOATCAATXAGATC 



GC r :GTA VAOJTCKXATTGTCT^^ 

GCC XGTGGTATtG<nGCTGGTTCTGGAAATACATTCGTC^ 
AACjCAGqCTAAAGAGCTCATGCAGOGTOTTAI^ 

iJTAACCTTCCTAAAAATATGCCAAA 
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XdJATG SGaCAAGGCGGTATGCCTCACTTATCAGCI^ 
T( iTrr^GTGC^GGTrrGAAAGGTAAMTTGGTGAATTTGCCATGMACA 
G ^AA| rGMfJAAAGCGA AGAAGAAACGCAAGGCGGCCGCACTCGAGC^ 

T539 homqlogue of SEQ. 3*D NO, 25" 
A XSTAtCTTATTGAAATTrTAAAATCTATe 
Tj 'CCa!< fTAdAGGTCACTTGATTTTAGCAGAGGAG^ 
T< fmAATGTCGTGATTt^GOT 

C( rrm AMGfcGACTMGGACAAACAGGAAGTTCGTAAGAC^ 

c( ACTtm^cnTAcm 



T, GCTC TCATGTTGATTATCTAGGGGGTTGCC1TCATCT 
& LGCCl UVGTGTMGAGAGTTGGACAAGCrrcCTTATACGACCGC 
ti CT^rrTAbCAGGGACTAGCeGTTC^ 

g arrlc tgagagaatttaccttctatcttgggattcccgttatc 
kj ATItl gtgaaagccggagaactcttgagctttgggcaattottt^ 

CI nffG MGlCAGCATCWTGOCTATTOSCnTC 

<X fTAAj/ lTaCGGTATCGTGCTIXKjTAGTGTTTTCCTACTTT^ 

c/ ccac cacgacgactga 



GAATA< 
AGAAG, 
OAATO' 
TCAC* 



2C£E30> "horaologue of SEQ. ID NO. 26' 
ATp0GHTTATTTOACCGTCTATTCGGAAAAAAAG 

TCTTGATTTGTCTGA^^ 
AAjGCAdAGGTtGAAATTGTTGAACAAGCTGTC 

GT|CTC<9aTTTa^^ . 
GtTTCTAGAGACTATAGAAGAAAATAATTtrrGAAGTIXnTC 

cggttcaggaaaaatatqaccgcagtgttaagaaaactcgtac^ 

tCfAAGTTOCCkrrCTCTTGACGAAGAATTO 
/reTTOGTOTCCAAG 
AAACCTGAtGCACTTCGTCGTCT 
ATGAAAGCATCCAC^^ 

:AACrrcrATCGGAAAACTAGCCCACCGCTACAAAC^ 
rc- CAfeCAfflj/VTAGtjn^GTGCGGGTGCAGTAG 
AGrA^CrGGACCTGAAAAAGCTGATC^ 
GG :ATCi 3ATATTCTCATGATTmTACT 
AA VAGA rTCkStCGTATTATGAAACGTGT^ 

Tp jvc^i jgk^aaatgccctagtacaggccaaAgaat 

TO iCTAS, lGATTGATGGAACTGCITOAGGaGGTOTC 
A^ LATlp JATTGGTTTTGGTGAAAAAATC^ 
CK TTGC AAGCrrTTAATCGCGGCCGCACIW 

1 - homblogue of SEQ. ip NO. 27" 
TATAntJAAATGGTAGATGAAACT^ 
TG^AATT TGCAGCCCAAAAAmGGAAAAGAAGACAAGGAGATGGC^ 
QJCJ TGAAOTAATCTGGAGTACCGTOACACC^ 




J{ GT/1 



2 CHEW, 



AO AQAi JTGGAAATTGCCTTTCU(X)AAGAGGATITGCn^^ 
OA( fTTlt ATGOCTATATTQCKWAAtroTW^TCTCTA 
Git AGAC CTTT BAGCGTGAGAIXKjGGTTCTTOGCAGTACAC&GCTTTT^ 

TA7 ACTC OOQA i LGAAGAAGCGGAGATGTTCGGTTTACAAGAAGAAATTTTaACAGCCTATGOACTCACA 
A» CAA< TCG/j 3<^CCAMAC«ACCACGACTGA 



homologue' of SEQ * 5 ID NO. 28" 



CTp WOC WACpXK3ACmXX»TGGCAGTTnX}GATCCrrr^ . 
GTT "TC^J IGQATAAGGCTOATTTAGCTQGTnTCAAGCGGATGTCro 
TGCpTACj' SAAAATACACGTTXTGCrcnxSAAM 
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1 

Vr 



6CAGC/ 



GltTTTGl 
l£ 0/|CAA 



aIqM^aaaTiocaqagctaaaaqaattttctcgtgcccaagac^ 

IPC CCTI GGCTXlCIOTCiTACrCATQCAATTTGCQ AOGCAQQCTGOCAAA 

tl QAC5< tccatcatgacaagaAaaagoatgctccgagtggaac^ 

WaAGliTrCGAOAGTXX^TTOW 

CI GACI TTQATGGTATGCXKSATCCACTCAGTltXJTTIXKX^ 
S GAA ! CA^GAGAAGGGTTGACCCTCCGTCATC^CTCCnATG^^ 

GAA^AMGAAGTreTCAAGCGTCATGAG 
QCAOdCCACj^CCACTGA 

2.CIB33I "homologue of SEQ^ LD.N(X.-29" .„..r«v,»~*» 

ai ^cWaaCAAACaAGATTTGATCGCTAAAGTAOCAGAA SA 
OTTOAAGCroTATTTGCAG^OTAGCTOACTATCTTG^ « 
3TAACTTTGAAGTlCGTGAGCaTaCAGAACGTAAAGGTCG<^ACC 
ITOCAGCTTCTA^GTACCAGCATTCAAAGCTGGT 
CjACCACCACOACWCCACrOA • 

Kt 2AQ1 \TCCGATTGTTAAAGAAACGATTGATCGAGCX)AGTXlAGGTGCTAGGTTATGArrTO T 

CArC^rACGPAAGAAGACAAACTCAAtCAG^ 

GC IATC rAaxmTATTOCMGA AAAGQGGTATCAGC^ 

CT rTGC riTGGTGGCAAGCOGCGCClTGGATTTTGAA^ C 
1A' ?ATG }AAG^Ga5CKriCCTGCTGACTCTGGCAAGATGGTAGC^ 3A 
V 3 TtiJAAGUVGci^K^AMGGTTCTGAACTT 
• M rCGT JATTXSCTGGAGaAGTKKJTTGCaGTTGATTXjAGCG^ LA 
iX CGi JDTGi JTCCTClTAAGXnOTCAGGTOCCTXTC S 

Vt tg. vA^arcnoGCTCAGGTAAGnrrrro^ !A 

K»< ^Ai^AGA^ACATTGCTCAGCTtnTOACXJCGICA T 
TG( SGGT IATOGAAGAAGCAGQCATAAGCAACTTT^ [T 
OT* -AAA lAAArrcATCAAACTGCTCACTTAGCTCATQTCGAAGATC^ 
A£. IAAC IXXiApCACCACCACCACCACCACTGA 

2CM35 ' "homologue of . SEQ-ji *D NO . , ai* 3 5 . 
AT< ^AACTAuAA'CATAAAAATATCTrrATT^ 
A&'TTGCrifcAAGCAGGAGCqAA^T^^ 
QT1 TTC^ AACtATGGTATxiiAGGTCGrKXX^ 
ATK fATTC fATCAACknrATlxkAGAACTOGGTTCAGTA^ 
ATAC^TATOCTCAAOATCAtAGAAGCAG 
^ T&TATt ACAttAATC^GTXHTOA^ 
AS, Q^OTit GtrrpATQBGGAATiATr^ 

Hi di/ GTO GTGG^CGCGAaGTOGCTAGTpa^TAtAraAG^* * 
1 GATA TOACAGCTATCTf ATCAGATAAC^TliA^ 
GGC/ GGCAGAqCAGsGTniS^ 
^fcC AT^TGOT^G^A^A^ 

ip NO.. -3 2" 




OCWtW i "homologue of. SEQ. 
ATC GGAdTOAAAMGAAACTAAAGTTXlACTAGTTTGCTAGGACTGTCT 

coa ctaatgoggtaactagcgatattacAgccgaAtc^ r 

GCGOAAATCATn^CTnTrATCGTTTGATATT 
S COTiCACTCCTCTTGCCAOTCnT^ 

GOA rTAA ggcgcttcgagaacaatatccaggtcgaqatato % 

TOP }TAA AGTAjlTrAAAGAAATQGGTGTCAGACAGTCAQAC V L' IC 1 1 IG GCCaATTTrOATPGAOATGCC 

V^GGTrATT [TGG<pCCTGTTCCAAGCCCTATCIAAGAO TTO 

^ ACC fTOG rAGTGTGGAtACAACCCTTKjTTCTTCCGATTT^^ 

TTO CCA, kCA^GCTTrGTCTGAGCGAAATGGCGCTACGACTGCGATGATGTATQOGATTCCAGTC 
TTT TAtf mGCAGTfTATOCGCCAGOTGGAGTCGCCCTAtACTGGAGAGTGTCTA^ 
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C\CCA2CA< 



r'OATGTOi 
AMCGtAGGl 



A' 

r< 



n 

CCA 
AC O 



XlTCKUjUCCTATTTCnTGAATAATCC 
\OAltnX30AAAATAGAAAAAGAAAAGCGA^ 
.CTGA 

adflew^homologue of SEQ. ID NO, 33" ' 
ATCLMkATrAGTAAOAGGCAOT^ 

tCIATTCXiACCACCAGTGCTAT 
LATCTTTTGGATItjGTAGTTTGATACTOATTGrc 
riAQAUTGAGCGACTAAtCATTTTAGmTATTAATAGAAATGC 
G< JTAT TCAGTAAACGGGGCATACGGTTOOATTTCGGTTG^ 
rAAAAATCACTATTATTTCKS^^ 

GTTTITGACTGaAAATCAATC 
[AAAOTTrGGGAATTrTCCCTGATTTAGGAA^ . 
XCA< riTAGTGGAATCGCTTATCGCTGGTTTT^ 
TCjrrG/ CCACTATCAGCCTAATCGOTff^ 
_ Cm AGTGCCTITmAATCCTITra 
^ <K CATC !GTCAATGGTGGrrcGTTTGGTCTAGGTCTTGGAAACT(X5^ 
U .GCTC A TACAO ACTTTGTCrrrrcrAT^^ 
CTCT rGTTTTTCATGATTTTGCGGAT^ 
OCAC TOTQfcTCCKlAGGaATGATGTT^ 
ATC? ACAGfiAGTAACCTTCCCCTTCOTATCCC^ 
CTl rGTXTTTAAATATrGATGCCAGTGAAAAACGCGCTAAGT^ 
ATjOMC CTTC|pTTOAACK:TCGAGCA 

2CB]B38f 
. AT sen 
crrrp< 



"hqin.Ql.agpje /of .SEQ. ID NO. 34" 
ATlQCTCbGAAlTITAACCITrATTCT 

C AAGAAATCAGGGATnTAGTACGTGAATTTGGCATCGGTATG 
TTtGCA ^GGAiXJGAACXijGCCTATACCATTC 

GGjOTOA TGATACAaCTGLAAaTCAAGACAGGAACGCCTGTTAGTTO 
K ATCAATCTCTCAGGrAAAAAATrGGATCAAACAG 
.6 ACAAGCICTTTATCAAAGQATTGOTTCTCK}AAGAAG 
AAfcGGT IXJTOCjLAAGCAGATGGTACTGAGGTTCGGATTGC^CCTT^ 
Aft 7TGG SGCAAAGTGATTACGAATTTTGCAGOTCCTATOAACAATT^ 

go rrnv atc^atgcagggtcgtgtcagagatgttga 

CC TQGi ICAAeknAGGAGTACCAGAAACGGCAG^^ 
^Ga jAAA (jCrroATCCAAGCTOTGOAAACAaA^ 

TTj CtG> AAAGGGG&GTOACAAACAAGTCACTGTTACAC 
TO TC^V £CQ(]X5GGTTAAGTCA^ 
CTC TOGC AATTXntTrcAGGTCTGAAAAATGTGAT^ 
Q<7 ATCT TTAAGGCAAGTAGT^TGCTGCTAAAAATGGAATro 
1TI CCA7 CAAfATTGGGATTTTTAAtXnTATTrc 
Alt ICTA( MGGCATCCXKXXKlAAACCATTGAAACAAGAAATO 
GTC AfC> TGGTTGTCrTOATGATraCTCTOA 
CO CCAC CACCACCACTGa 

£CK39,. honiologue of SEQ. ID NO. 35" 
ATC T^cd^t^mAAAAOOAAT^TTAOCAAAA 
TTG STTA rATCqTQCATGltKXX^T<XTTATGCCTATTCAGGTCAG^ 
GtC CATC AaQTrraTOOiTOAa<?AakXCATTTGCT^ 
^ TTCrTAGICTAATTTaiOTCnX^ ■: 
M» CTOGCTrMTTOAAQCCATTOAAACCSAAOAAC^ 
_^ AACAOaC^GaXOATPGTOCTCWACT^ 
V|: GGT 2QCA GTGGAACKIAAGTGCTCIAAAACCAAGAATTGGA^ 

PT*A 7A A ftPA A'OA/IA fZ'fTf^A A A A' a 1 AAA TTWrarT/* a * r-» r-»-^, r. . - _ 



TAT :AAG rCGG|XClTAAAATGTTT]GTCAAACTt^^ 
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JB40 "homologue of S$Q. ZD. NO. 36" . 

A: OA44?AATAATCOTATmAGCACTnCT^ 

GC CTA< ICTATACCTTGAAOlGCTTGCATGGGrnnGTAGCAGTOACTTGTTTGA 

0( iATTl GAAG^TCmCCAACTGATGATACCATTTITCAACATO 

TCGGCHJAATTAAOATTGGTCT1CTCCCTACTCTCAOTGTGGCT 

C( CCD ,GGAd|TACCTGTGGTGTTOttATCCT^ 

AC C7CT GCCA.AGAGtTCATTCGCTTTTTCCCrrT 

TT ^TTArcCX^TCAGGAAArrAAAACCTTGGAAOACATGAAAACT 

G/ ,GCQ< !CAG<&GTCATTATCA AGGGAGGCAATCGTCTTAGTC 

TC GAQ GAC»TTTACTATCCTAGAAAATCCAGTTATCCAAGGCCAAAATGCrc 

GC CTT^AGCATTGCCAGTCACTTGATTAAAGGTGATAAACTm 

TC 3TTt VTCGTOCTATTGCACAAGCAGATCAGTATG GAGTMGAC^TATGAAGCAAACAAAAACAACC 
TC SAG£ ACCAtCACCACCAGCACTGA 



£0*1 Hi 



n AAA' 
GO 



2.dfS41i "homologue of SEQ, ID NO. 37" 
XTpATTtoAAAteGGAGAAAAAAGAGGAG 

qAO jTCTCCATGGAAGAATTGGCTAGm^ 
GAlc^AACQlnGAAAAArATGATTCCAAGACCTTC^ 
0G rtfG/TGCAiQAAGAAATC^CTACTGTCATCGTCAACA^ 
Qfi GGA> GTTCtCGGTOTTAACKSTCATTGACCGTATC 

CO TGAAGGGAAGCTCC^GTCCACCTAGCCCAACTCAAATACCT 
(ATI ^TGCpTCAGCCGTCAGGCAGGGGGAATrcOTTCCCGTGGTC 
AaCCGT 2GTAQCGTTCX3CAATCAAATC 
3ACT STCAOAGAAAAACGTTTGGAGTC^ 
MAI CAACTATCATOAACATCTTOACC^ 
GG \TG£ jACAACCAAGAGTATTCATCTGGGAGGGAACCTCCAAGTAAC^ 
ATpCAA JATTIjGCCGACAOAGTTCKnX^ 

G' TCATOTTATCGATGCTAGCAATCCTrACCACGAGGAGC^ 
AAj\GAC CTGGACAlXK/AAaAtATrcCTCACTTGACGCTTTAT 

a^ccaAacgccatataccctgatttc^^ 

XJATAAGATTAAGfcAAATTTITO 
AtAATirAGAGAGTCTTXjCAATTCr^ 
GCfACA' TTCGGAGAAAAATAAATGGAGGTTAGAAGAATTTTATGACCTCGLA.G 

CTCrA t ' 



X % C AT< u 



"homologue. -Of SEQj ID NO. 38". 

at^gcaCaaaaaacatatcctatgacccttgaggaaaaggagaaacttc 
ttg< ntggtcgaccagaagtggtag^cggattaagatttgcccgttc 

GTGj LOTAGOAAGCAGCTAAGGATGAACMGCCTTIX3TCGAAGGACAAATC 
AAj ,TCO iCTA^OCTGAAATCGTCAATAGCGACGCAGTTGCCCAGGACGAAGTA 

CCA1 CCAAGAMTTGGTGAGGACXJAAGAAGAAGTTTATATTAT^ 
CTT IQQA GGTAAGGTTJCAAATGAAAGCCCAATTGGGCAGGCCTTX^ 
AG(fAAC< lATTOAAACGCCTGTTGGTAGCTATGATGT^ 
CCAJC 5CACCACCACCACTGA 



7X3?rf43' "homoao^ue-laf £EQ. ID NO* 39" 
ATG ACCMAATI{ACTroTAGGCTrcTO 
TTA TG*TT }ATT<jiATCAACTAGCaAAGAAACAGAATGTC^^ 
OCT \GCA IWrTTTTCCTAAAT<k>AGAAA 
GGA AAA< rCAGTTCATGCtTTATTAACtTACTATC 
TCT PGAC il^^GTTQGGAAAATTCGTTT AAiSA 
GTfc TATt ^TTCAACATATAGGA AC^GGTCm 
GOT *TGT ^AGTTGTTCATCATGTTTTGAGfAAGTTTGACAGG^ 
TGA 2AAA GTTG ^CGAITCTOTAAACTAGTATTTACAAGaGAAAAAATTTC 
TAApGGjjV CTCG ^GCAOCACGACCACCACCAGTGA 
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fccjiBM "homologue of SEQ.. ID NO,. .49" 
A/X3ATrftTAAiTACAGGOOCAMT^ 

Ai iGAA rACGifQGCAOTAQATQTGGCTGAGATGOACATTACCQATGCAQAAATQQTTCAGAAA.GTTTTI<3 
ki iGXG }TGAAACCGACTrTAGTCTA(XACTGTGCAGCCTACACCGCTGTTGATGCAGCAGAGGATGAAG 
Gj .AAA jAGT^GACTTCGCGATCAATGTGACGGGGACAAAAAATGTCGCAAAAGCATCTQAAAAGCATG 
G1 'GCAIi ^CTCTfkGTTTATATTTCTACXjGACT 

A< fTTG 1 / tgaccgaccagatccacagacagaatatggacgcactaagcgtatgggggaagagttagttga 
q; agg ltgtgtctaatttctatattatccgtact^ 

ti acc/ tgcamatcntgcgaaaactcataagactitaacagttgtaaatgacgagtacggtcgtccgac 

71 GGA( TCGTAanTGGCTGAGTTCATGACCrACCTAGGTO 
TO TOA> ATOATGCGACyWJAAOACAC^CATGQTATGATTTrGCAGTTO 
CC AAG' CAAQCCAGTAGATTCCAGTCAATTTCCAGCCAAAGCTAAACGTCCGCTAAACTGAACGATGAGC 
Q1 GQC( aAAGCCAAAGCTACTGGATTTGTTATTCCAACTT^ 

A/ G AAf( JTGAGACTCG agcaccaccaccaocaccactga 

2CBE45 ' "homologue of SEQ. .ID NO. 41 " 
AlGAAACGTTjCTeTCGACTCT^ 

OT GGTC GCTATCTATATAGCXXnTAGTCATGATrATCCC^ 
CG CGTG GATTGCCTroGGGCnTGTGATTGGTTTTGTCGTC 
TG ACCC [XTTTCTATATATTTTAGCCTrGGGACTTATG 
TT 5CAT :AACGGGTGCCAAAAACrCMjGTATClAATAAATX}GAATrAC(X^ 
Ga AGA3 ATCCTATATCCTCATGTTGGCTCGTGTCATTGTCC^^ 
CG ^fcc OTK^CTGGAClTITroTTAATT^ 
GC \CTT 3AAApTGACTTOGGGACGGCrrTCGTTTr^ 
GO YTTQ' IGGAAAATTATTATCCCAGTATrroTGACTGCnXiTAACAGGA 
Tp ITTA< iCAAGGACXSGACGAGCTTTTCTTCACCAGATTGGAATGCOT 
» GO :TTO 3CTCAATOX:rrrcAGTTTC^ 
GG 3AOT SGTGlGCnATTTTGTCAG 

GA rTp}- ACGGTTATTGCAGAAGATTTTGGCTrrATIGGCTCTGTC 



qa rriA< xxjtatgttgaagattactcttaaatcaaatmccag^ 

3 i tTGA" "GTTOCTCtTCCACATCTITGAOAATATCGGTGCTGTGACrG^ 
( TOtt ICTTTCATTTCGCAAGGGGGATC^GCTATTaTCAGTAATO 
AT( >Afar rACCAGACTAATCTAGCTGAAGAAAAGAGCGGAAA^ 
TT^AAAvAAATTAAACTCGAGCACCACCACCACCACCACTGA 

2C**46 "homologue of SEQ. ID NO. 42". 
Aft KJGAJUAATGATGGOAATCACaGGGGGAATTCCCT 

AG< JAAG KJTTTCAAGCAGTGGATGCXGACGCAGTCGTCCACCAACrACAGAAAC 
TGJ X3GCF rrAGTA(^GCACTTTOOGCAAGAAATCA^ 
GC 'AGTC TGATCTTTTCAAATCCTGAAGAGCAAAAATGGTCTAA^ 
AG( »MC" [GGCTACTTTGAGAGAACAGTTGGCTCAGACAGAAGAGA^ 
TT1 rOAG CAGGACTACAGCGATTCGTTTGCTGAGACTrGGTTCGTCTATGTGGACCGA 
GAJ iGGC TAATGAAAaGGGACCAGTTGTCCAAAGATG AAG GTG AGTCTCGTCTGGCAGCCCAGTGGCCTT 
f AC AAA LAMGAAAGATTTGGCCAGCCAGGTnnT^ 
AA^TG^TATCCTTCTTO^ 
OA 



£CFEJ*7' "homologue of SEQ* ID NO. 43" 
ATG AGAyiAAAtTOTTATC^TGGlGGAmCCACT^ 
GTG rCGT rGCOTAAfTCCAGCjrATrATCrTGGCTGAfGATGTCGTO^ 

CGC AljG? AGCC^GTCTTGTCOAAATCATGGAATTGATGGGAGCTACTGTTAAGCGTTATGACGATOTATT 
GGA dATT GACCCAAGAGGrGTtCAAAATATTCCAATGCCTTATGGTAAAATTAACA 
TACrATt TTATPGGAGCCTCTTAGGGCXSTTTKKjTGAAGCGACAGT^ 
TGG rCCTi :GTCCX}ATTGACTTACAC<^ 

GAT \ ACA IXjAApTTATCTGCTAMGATACAGGACTTCATGGT^^ 
TGGpAGC^C^TTMTACGATGATO^ 
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i j ere rr (c°**°0 

b XJTG VA(£IX?AGATTATTGATGrA^ 

A H*M rATCATCATTATTGATGGTGTTGAAAGATTACATGGGACACGTCATCAGGTGA 
T X3AA Kn^MCATATATATC^ 

GjIACA :CTG(5AAGGGTtraTTGCTAAGTTGGAAGAAATGGGAGTO^ 
' TOT GTCGAGGAACAGTCTAATTTGAA^ 
T TGC^CMCCXjCTTACCOT 

A LAAACGtGTAAATCATGTTTTTOAACTAGCAAAGATGGATGCG 
ATOTGTACACGGGTGGACGTGATTTACGTGGGGCCAGTGTTAAAG^^ 
rfcACVAGTCKTTC^GGOTA 
G( iXTA* TCTOATATTATCGAAAAATTACGTAAT 
A(Xaq<:ACCAOCACCACTGA - 

'CIEW 'homologue of SEQ. "ID .NO, 44" 
AT GIX^GAATTGAATTrrc 

rt tGaa tgataaagtaggatcttatc^tatc^ 

CI CCTT jGrTCATTCAAGAAGTTCAAAAAATTAGTC^ 
CC AAC< TTHX3GGTAGATCAAGTTCTCGATT^^ 
iC GTCl rGCTTnrrCGTTTGATTGAf AAAATTCATO 
A/ ACA,< ICTQTTICTACAATCTTrGCCTAC^ 
^ CC TTTT jCAGGACAACGCTTTTTGGAGTCTACCTTGTA^ 
% Altai TTATCACnACATCATTGAGATGGATGG 

CC AG& .CXJaGATAT^ATCTTATAGGTCGCAGTGGATTATTTGGTTTO 

cc gat/ lct<5nctagagattacgaagaaat 
j«:caccaccactga 

i "homologue of SEQ. ID fJO. . 45" 
At 3 AGAlAATATGGCTtTGACAGCAGGTATCGTTC 
CA VfTA ^vAAjAGCAGGAGCAGAGGCAGCAAACTAOCCATTTC 
TG 3AAC rTCXlAGATGAACGCCTACAAAAACTAACTGAAATGATAACTCCTAAAAA 

ca rrra ^tttacagatattgcagggattgtaaaag^gc^ 

tc rro© ^aatattcgtgaagtagatgcgattgttcacgtagitcgtg \ 

GG XJAG ^AAGGACGTGAAGACGCCTTTGTAGATCCACtTGCAGATATTGA 
Tt<:rto(STGAd^AGAATCAGTGAACAAAC^ 
AT. iAAG lATC^GTAGCAGAATTCAATQTTCTrCAAA^ 
CTC IGt/ii :CATT?GAATTAACAGATGAGGAACAAAA 
AG ?TCn TATGTAGCTAATGTGGACGAGGATGTGGT^^ 
AY "CfeTX IAATTuPCKZAGCGACaGAAAA^ 

CTC tAATT t)GATCATGAAGATAAAAAAGAGTTIXjTTC . 
OT 4 GAC( ijOTTGCAGCTTACCACTTGCtTGG^ 
Gfc TGGACTTIICAAACGTGGTATGAAGGCTC 
G(p- TTA7 TCGTtJCAGTAACCATOTCATATGAAGATCTAGTGAAATA 
AC( rraGi iCGCTTGCGTGAAGAAGGAAAAGAATA^ 
AA' GTCC TCGAGCACCACCACCACCACCACTGA 



o 



TjCFriSO" "homologue of SEQ. ID NO. 46" . 
ATC Gi^UTCGAAAAAAa^^ 

»TOAATTA™TCGAGCTCrACTACGCTOATGA 
TCGTCAGGCTX5TCTATX&^ 
GG^CATgTACTipGGACTATAtTGTCCGOlG 

^CiiG^AOCAGATAGAAATTTTAACMGCATTGATAATAGAGAACTCGAG 

ACdAi 
ICFJfcSl 



homologue of SEQ. ID NO. .47" 
AT^QCn1rAGMTG(WAA<2MT^^ 

GTG 3TAT rCGT^GCAATATXXJTAAGCAAAATAAGCATTCTCC^ 

^GCC vatt jAGaJjkjatcaaaga^aaaatggct 
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\ fclcGAtATTGCTGQCrTACGTCTGATGGTreAGTTIGTAGATGACG^^ 
S AAG2GTCAG(^TATGCGAATO^ 

Jtcc ATCATGTGGTAGTAGAATATACGGTTGATaCCATCAATGGAGCTAAGACTAT^ 
^ Ti AAA'TreXJTACTTTGGC^TGAATTTCTGGGCAACGATAGAACATTCTCT^ 
jkX TTH e^GATGAGATTAAGAAGCGACTG^ 

i!ffii?AG^ 

9ri«52/"homologu^ of SEQ. ID NO. 48" . 
£ GGAiCnAATACACACAATGCTGAAATCTTGCJTCAGTGCA 
A CTO ^OAGATTCCCCTAGCAGGGCGTTCAAATGTTGGT 

CC GTA; qaATUIWJODOCTACATCAC^^ 
^ 0/ CAA< jaTCCJG^TTTGTGQATGTGCCTGGTTATGGCTATGCTCGTO 

fi^f^'GCATOATTGXGGAGTACTTA^ 

^^CCCGTCAGCAGATGATGTGCAGATGTACGAATITCTCAAGTATTATGAGArrc 

ATfrGTtQCGA^AAGGCGGACMGATTCCTCOTCGTAAATGGAACAAGCATGAATCAGCA^ 
1/ ATT/ AACTTTOACCCAAGTGACGATTTC^ 

CT rGGC ATGCAATCn-AOAAAAATTGGCCKJCCGCACTCOAGCAC^CCACCACCACCA 



<KG! 



^onfesj "homologue of SEQ.. ID NO. 49' 
A ft AAAACAAGAAAAATCCCTTTCCGCAAGTCTG^ 

2GCA tTOTGAAGAACAAGOAAGGAC^AGTCrTTATTGATCCTAC . 
C"T PATA rCAAACTAGACAATGCAGMGCCCTAGAGGCGAAAAAGAAGAAGGTCTTTAACCGCAGCTTTA 

CC«OeAAG1XK3AAGAAAG(OTrA^ 
AG ITGG SACTTOAACTCGAG^OCACCACCACCACCACTG A 

■2.CFSS4 "homologue of SEQ, ID_NO. 50" 
Afr }TTAAAACCCTCTArTGArACCTTGCTCGAC^GGTO 
AA iACC rGCCCACOAATTOOMGCAGGTCKXXX^GCAACTCAAGGTTT^ 

Cfi :Q cr TAGAAGAAATCGAATCAGGAAACOTTACAATTCACCCAGATCCAGAAGGAAAACOT<1*lAGCA 

gt< seen xsccgtatcgaagaagaAaaacgccgcaaagaagaagaagaaaagaaaatcaaagagcaaat 

TG< rTAA LaAAAAAGAAGATGGTOAAAAAATTGTCGAGCACCACCACCACCACCACTGA 



.a* 

vo A' 



^ 139' 



IS 



.cfki 

ATO* 
T<t<IG 

t<}>a< 



;5$ : "homologue of SEQ. ID. NO. 51 
TCAtTAAtATCAAAACAAC^ 

:gggA iaaatggactcaacqaccaaatcaaaaccagcqtccgtcaagctct 
ago' tactctctta(^aaacagagatgaaaacatcca(x 
otqi ggatacagtccaaaaaataggacgcatcttg 
caa< f atttctmgaaagf caaagaaatcctcga 



homologue of SEQ. ID. NO.. ..52"* .• *• > . . •. 

AtdtiCCttTTOAAAATTATA^ 

ag< CGdi xxjgJatgaaggctgttttgototatttc 
aac ceo xjagatoaagcaatxkjct^ 

AAC ACC/ AAAAaCGCGTTCAACGAOCAGTTGAG 
COT rCAd \TTX^TATTdACGGTOCTATGAAG 
CQ/\ CCAC CTCATGACAGATATACXJAGCaGCCCaCCGTGCAGGGAT^ 

gtc caac atgactcaatcaaaacgcagattaaccgaactc 

AA^ AGTi £G6ij|C^mCATATAAAAAA 

OC&il "homologue of SEQ- ID NO. 53" 
ATC mCpAAAAAimAATTGGCAATCOTTC 
GGC GATI GGGACGGTACGCKnTTA^ 
$\ GCA 3TTT jTATTCGTCCTGGCAAGGCAACAGAGTCTTATCTC 
CTTfACT^GG&G 
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Gl GAAC AAGTAGGTATCAAGTTTATCGGTCCATCTGGTC^TQTTATGGATATGATC 
tg cacc tGCTCAGATGATTAAAGCAGGTGTGCCTGTTATACCAGGTTCAGATGGA^ 
QJ\ A&\J .GCTOTGATTGTTGCTGAAAAAATTGGCTATCCT^ 
Gl AAAC GGATTCOTAAiCKJnGAAAAACCAGATGACCTCGTrT 

CA KQGC CAATTATGGCAATGGTGCCATGTACATAGAACXSGGTTATCTATCCAGCTCGGCA 

CA AATC CTAGOTOATGAGCATGGACATOTGATTCACTTGGGTGAACGGGATTGTTCTCT^ 

AC CAAA AGGTTTCGAAAGAAAGTCCCTCGATTGCAATCGGAA^ 

CT 3iJTG rTCG£GCGGCAGAGTriX}TTC^ 

AA GTAC AAATrifTCTATT^^ 

TC iGGTSTTtiJVTATCOTTAAGG 

AT Vl^G rCCTACGCGGTCATGCCATCGAGTGTC^ 

AG TCCA GGTAiAGATTACTAATCTCTATCTGCCXAGTC 

AT ZCAQ STTAijArcATTCCGCCTTATTATG 

It [TOA< 'GCCiTGATGAAAATGCAACGTGCCCTCTATGAATTAGAGATTGA^ 
GA niTC ^AGCTTCATCTCATTTCAGATCGCA 

Aj\ CCTT nTACCTAAATATCAAGAAAAAGAACTCC^GCACGACCACCACCACCACTGA 

ZCFSSav'^omologue of SEQ. ID NO. 54" 
AT iXTTlTACAMGrnTITATCAAa 

AC TTkQ \(^TGGATGCCAGCTCAGAACTTCAGGGCCGTATCACTGCT 
CCi ^{3A 3TACAATATCGAGTATATCGAACTCTTGTCTOAO^ 
GCCTTDpAAATTACGGAGTTCCn^ 

i 



Ai 

o 

GA 
GGtTTG' 



cc/A< 



GA& 
GTG 



^CT&frV'hqmologue of SEQ. ID NO. 55". 
AT( 3AAGDATAOA TATATmAGCATITCAGACATCCtGT^ 
!< jATG \GCTCTrGTCCAATGTCATTGCT^ 

LApT, VGCGAGTCGTC^CCATOTCGAGGTCATTACAGCeTGTATCGAGGAGGCA 

k irrkQ< :gaagaggacgtoacagctgttoc^^ 

rCAGCTGCCAAGGCCiTTGCTTC 
GG|LCCT< ^TGGCAGCTCAGAGTGTGGAGeCTTTGGAGTTTCCCrTGCTAGCG 

CAG li3TT€KmnATOmCTGA 
TG4GGA KrCTTATGACAAGGTCGGTCGTGTCATCGGCTTGACCT 
CTC GPTC ATCAGGGGCAGGATATTTATGATTTCCGC(1GTGCCATGaTTAAGGaAGATAA 
CCI TCTC AGOTTTGAAATCTGCCTrrATCAAT^^ 
TA( AQAu LGATrrGTGTGCTTCCTTCCAAGCAGCAGTTA 

GAGi AATATOCTCTTAAAACCCTAGTTGTGGCAGGTGGTGTGG^^ 
AGC^UCTGAAATCACAGATGTCAATGT™^ 
TATjGJVTIGCTIATGOCA^ 

GTiTIGCCfrTGATAC^^ 



■homologue of SEQ. ID NO. 56" 
ATdTGTCjGAATTCTTCkrroTTCrrT 
TTC AATA CCGTGCknATGATTCTGCGGGAA^ 
GTipqit OTATpaCAGAATTGTCTGCCAAGACAGCTGGTGTTOAGG 

JGC AACI^TOGCUAACXAACXK^iQACAATC 
3T rCACJlLATGGGGfGLATTCAAAACTACCTT 

caWaggc ^caaAgagatacggaaatcgccgtacatttgattggaaaatt^ 

$^ 'AQfl TOTT 3AAGCC1TTA AAAAAGCrcrrCATATTATCWTGGlTC 
M^ATtC AGAT&TCATCTATOTAGCGAAAAACAAATCTCC^ 
"J rGCT^GATOCTATG^ 

k IXjGTpAAGGCroATAGCGTGGAAGTrcAAGACTATGATC 
VTA :TpC jGAACITGACrnGTCAGATAtCGGTAAGGGAA 
1GA 3CAA CCAACTGTTATGCGTAAaCTCATTCAAGCCT 
CCT 5CTA rCATflAA(K5CTGTTCAAGACGCA 

1 SATrraCT^CrAAGAAAATCTTGGAAGAATT^ 
KKjCrACG&TATGCCA 
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C :dGA rAGTCGtCAAOTTTTGGTCAAGGCTAATGAAATGGGAATTCCAAQCTTAAC^ 
C IGGT rCAACCCTCTCACGTGAAGCCAACTATACC ., 
A J fCAA JTAAAGCCTATACAGCGCAAATCGCAGCCCTrGCCTTCCTTC 
Qi }TAA rGCTMAGCGCAAGCCTlrrGA(Xn , GGTTCATGAATTGT^ 
c -cm CAGXGAAAGAAACCATTGAAGCCAAGGTTCGTGAACTTOT 
Q lTCG( UCGTGGTCAAGATTACTACGTAGCCAT^ 
Ci iGTG' YjAA^GTTTTGCGGCAGGAGAACTCAAGCACGGAACCATTGCCtro 
G' «1T< GCTdTCITOTCAGATCCAGTCCTT 

XJGT< iCTAAGGTXXnCACTATCGCAGAAGAAAATGTTGCTAAAGATA 
IX ITAO .CCCTTACCTCTCA^ 

A( icgtggcctcgatgtggataMcca^ 

ACSCACCACGVCTGA 

2CIE$l "homologue of SEQ. ID. NO. 57"- 
kl G ArAcXnATCGAAAATCTGA<^ 

TC catxgaccaacaattaccggcatc^^ 

aC TfJGC rAATTATOCCACATCAAGGTCAGGCATrrc 
Al TQ(X TATGTCOAAGVAAAA ATGAATATCGACTAC^ 
TA GGAC TATrtGCCTCTATC 

OC TTGA AATCGTCGGCCTAGCTGACTACGCTGAACGTCAA 
CG GGTC TTGAlTGCCAGATOTTTGOTGCAGGAAGCCGACTATATCCTC 
TG ACTC FGTCAGTGAGGAAATCATCATGAATACGCTGAiGAOATTTGAAAAAAGCT 
CATCGTrcACCACGACCTCA 

A1 TGCCrTIXKpTCGAACAAAAGAAACTmACCGAAACCAAT^ 
TtT tTCA ITGG^GGTGACCtACTCGAGCAC^ 

I "homologue of SEQ. ID NO. /58" 
AT }CCGj\AAGAAGTOAATTTAACAGG CGAAGAAG GAAOAG 
OA rGTT ^ATTTTGTCCATAAGGCCTTGGiCTATGCTGTTC 
CG <GCC C^ATATCATTCACCXJrATCCAAGTGGCAGGTATT^ 
GT \GCT fGTG0ATTCTIXK^TGATGTX5GTGGAAGATACAGATG 
TT< 5GTC( rrGATGTGCGGATOATTOTrOA^^ 

GP \QCA ^TTAGCGGAAAATCATCGC^GATGCTCATGGCCATGTCTGAGGaCATCCGCG 

AA \CTG rCTGACCGCTTG<^CMATA 

Ct- lAAG iAACCATGOAAATCTATGCCTC 

ACi VAGA STTOtCTrTCCGTTATCTCAATCCAA^^ 

*CQ< :aGG 1 AGCGfCUGGCCTTGGTGG ATGAGGTAGTX^CAAAATTAGAGGAGtATAC6ACAOAA.(K>TCAC 
TTC AAA! IGGAAOATTTATGOTCGTCCXAAGCATATTTACTCAATTTO 
AA< :OGT TGAGOAAATCTATGATCTGATTGCTATTCGTTGTATTTTA 
VTC fCTKjGTTACGTGCATGAATnTGGAA 



aotccvUtggttatcagtct^ 

AA< !CAA< KjAAATGCACGAGGTGGCTOAGTACGGGGTTXjCGGCTCACTGGGCTTATAAG 

GGC JGCA IGTTAACAGCAAGGAATCaGCTATTCGAATGAACTGGAT^ 

CC/ GGtf GATQATGCTAACKiAATITOTGGACTCTG^ 

TTI \CCC CAGAtGGAtieTCTCGGTTOCCTTC 

TAC CAAC GlCGGTT3AAAAAGCAACTGGTCfOCAAGGrc 

AA> GAO LGGGGA7CAGGTTGAAATTATC0iX 

ATC GTQA AGACTAGCAAGGCGCGCAATAAGATTCGCGAGTTXnTTAAAAAC^ 
GTC AAC/ AGGC^GTGAGATGCTGATGaCTCAGTTDlAAGAA 
AGA AGCC CCAWTGGATCAAGTIXnXX^UAAG 

tgg rnr JdcGjUATcckiTGCdAmcoGicm 

CCt 3CCA \GGCCMGGCTGAGGCAGAGGAGGTTCTCA^ 
AAC TCTC UMTCAAGCATOAGGCKJGGAGTGGTTATTO 
AAC TGTT jTAACCCCGTGCCTXKrTGACGAM 
ACC 3TGT 3GACjlGTATGAACCTGCGTGCECAAGAAAACT^ 
AG* CCAG TACTCTAGCTCAAATAAGGAGTATATGGCCCATATM 
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<flTQTIfcAAclcUTGTACIGC^ 
&AACQATAWAGTTKKnAATATCCATC^ 

rAAAATTAAGAOTGTGCCAGAAGmACTCTGTCAAACGGACCAACGGCCTCGAGCACCACCACG 

CACTtjA 



crca 



ACGAi 



akj; 
jta! 



2 CFiSi 



$ ATT* 



lOAjI 



"homologue of SEQ. ID NO, 60" 
{JAAGAAATCAAAAATCTGCAGGCACAGG^ 
uGCTQTTCGTATGCGTCCAGGGATGTA^ 
jAAASTOTTGATAACTCAATTGACGaGGCCTTG 
OKGC^^GATuATTCGATTACTOTTGTGGATGATGGGCGTG 

\ }CTGCTGTTGAGaCCOTCTTTACAGTCCTTCACGCTGGAGG 
GbTTfl JAGGTOGTCnX^CGGGGTGGGGTCGTCAGTTGTTAATGCCCTTO 

t< :ca^ j\aaacggtaagattcattaccaaglaataccgtcgtg 

cj ^tacqgaraaaacaggaacaactgtrcacttcacaccggacccaaaaaxc^ 
;ttitgataaattaaataaacggat^ 

cfATCiicrGATMGCGCCAAGGrn'G 

cx rrtdJ atatatcaacgagaacaaggatctaatctttgat^ 

QJ kTAT( lAC^GTTQAGiGTAGCCATGCAATACACAACGGGTT 
A: TlfTC ATACACAlGAAGGTCHjAACGCATGAACAAGGTrrc 

n atgIc tcgtaagaataagttactcaaagacaatqaagacaatctaac^ 

Ci TAAC TGCAGTTATCTCAGTTAAACACCCAAATTC 

a> tag!( igaagtggtcaagattaccaatcgcctcttcagtgaa^ 
c/ cad ,ttoccamcgtatxx2tagaaaaaggaattm 

OC}GTG> AGT^CAOn'AAAAAATCTGGTrTGGAAAm 
*l AACCCTGCTGAAACAGAACTCTT^^ 

^ ACCGTQAGTTTCAGOCTATCC^^ \ 
Q^ATAJ GATTCtAGCTAACGAAGAAATTCGT^^ 



TTfcGTA :CC'mnnTTAACCrrOATTTATCGTTATATC 
CC 2AA$ CACCAATCTATGGTGTCAAGGTrTG^ 
AA Q AA/ TCAAACTCCAAGAAGCTrTAGCeCGTTATA^ 
TA \Gbt GCTAGGTCAAATGGACGATC^TClAGCTGTGGGAAAGAACCATGGA 
CG STAG ^GTTTCTOTAGATGATGCrGCAGAAGCAGAtAAAATGTTTOAT^ 
GA 3CCfr XSTCGTGAGTTTATCGAAGAAXaTGCTGTC^ 
AC2ACGACTGA 



homologue of SEQ ID. NO* * 61 " 
ATfeGGAlTITACTGAAGAAACAGTACOT^ 

kCAO ^TGTfTATtknTCGtTtjAACfGATAAGG 
GA JtpG IGACCCTGCCTACOTOXn^^ 
GA UlCi JTtGAGGAATTGGTKXK^ACTATCTCAA^ 

Aa&c^cTG^ ...... . : 

& C$166 I'i&bmologue of SEQ.. ID NO ^.52" 
ATqatC^CfATCCAG^TAAAGTOCATCA 

r ™ -GGAATOTCITrTGAAAAGATGATCAATGCTAC^ 
TATA UTAAGAAACC^CTCOTATTCA^ 
GTT^CteCTAtTtTCGACAAGCT^ 

TCAAGGAAACAAAACAAAAACGTGCGATTGCGATO 
CAljATp^CAAGTCOT 
CTA CTTA riXKXpGCATK^TO 

AAT fVTAT rcGAJSMTATGGATATGAAATCAAGGCIGGTGCCI^^ 
AAA GAAj< fATTIHmGSTW 

2CFEJ7 ("homologue pf SEQ. ID ,NCU 63*' . * 

II . I • r * " ' ' ' 



citrraAAG* 
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^ StSgWcGTATCTGTCGGCACTC^^ 
S^lSn^TATOGGAACCT^^ 

^ CC See OTAAA^CATTCAAATG^ 

^ S TCA SGCTrcmGCTATGQAAGGCGATATCATrCTGGCT^ 
Cfc CGT^TOTGATOAAAATACGGT^^ 

ArinC^XKJACKKJAOTOT^ 

SffilAGGG<^AAATCATCCAGATAAATC 
AAl'^'TCA^OTACAAGGTGaAACGATTTGTAGTGG 
AC( X3CG( :GTAGAAGCrAGTCAAGCCTACGaAGCAAAGCTAGAAGAG 
CK( JGAT< } AACGCHTC ACAACAGTGGCTOCTGAGCGCATQTTGA 

AA< 3CGC u^OAAAGTCATTOATAAGTTAXJ CAGCTCAGCTGATlTrACAAAATTAT^ . 
TCC AGCiiCCACCACCACCACCACTGA 

2 CFI!fi9 1 "homologue of SEQ. IP.jNQ._- 65^ 
at^ai 

TA< 
CA 
GG 



■ ^ 



AGlAi 



TTA 



£ ATGlGAi 



.^^gaaagakkk:gtaatcactaacgataaccgtatcacagca 

CA?CGA> CAAGGTGGACGTGCAATTCTTTTClCrc^ 
GG' 'AAA' CACTTGCTCCTOf AGCAGCtXJACnt^ 
CA< TcS GGTCCTCAATTGGAAGOGGCAATX^^ 
AC] CGT3 XCGAAGATGrrOACGGCAAdAAAOAATCTAAA^ 

GATdQTATCTTXXn'AAACGATrGCATTCGGTACAGCTCAC^ 
JCAAACGTTOAAAAAGCAGTrGCTGGTTTCCT^ 
A^bHoAMCTCCAGAACG^ 

GT1 ATCC AAAACTTGCTTGAAAAAGCTGATAAAGTCXTTATCGG 

AAC fCAd lAGC^ATOGAAATCGGTAACTCACTTG 

TQA AAA> tGCAAATGGTAAATTGATCTTWCAGTTGACTC 

GA/ gtcogtgacactgaaggtx^ 

TCC CCA* ATTTGACGAAGCTTTGACTGGTGCCAAAACAGTrGTATG . 
AA/ CCti GATTTCCAAGCTOGTACAATCGGTGTGATGGACGCT 



5GC GTCCnXWAOCAtCAAWC^ 
& CGA(K^CCACCACCACCACCACTGA 



... "homologue of &EQ, ID NO. 66" - . 

atc ttaaIaatcaga^aacaatcacgttatcaaatgt^ 
aaa cca/ tgttttggctaatctttccaacgcc 

ogci tttatttgtixxjatggaaaggaattggttrragg 

TAT rqCA zrAGGCAAG^TXntTGTGG^ 
AOG ACC7 ATGTXtAACTATATTTCrTOTC 
ATG 3TCA Gmtrn^AGTTCrTCGA 
[TTG( iAACjlfcATITOlCGCT^ 
CTCG> IGCACCACCACCACCACCACTGA 



,, hoioJ.ojgja0.ofe-5$Q.. IC NO. 67" 
lAkCTATTOAC^ 
AAACAAGTIGGbATTTTAGGGGGCSAATtW 
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2 ere 7/ {e^J 

TXCGG<tAACAGTTGGGACTGGATCAAGTTCTTCTC^^ 



V K ft< !AAA ^TCCCTGAACACCATCQTCTCAAGATGCTTGAGTO^ 
^ A TQAJ ScATOAG^GAGCGCAAGGGTATTTCCTACACCTACGATACCATGAAGATTTO 
> V" ■ 4 a ■Vy t A OATA rfiGATTATTACTTTATC ATCGGTGCCG AC ATG GTTG ACTATCTGGCTAAGTGGTaCCG 

W QCA<CAC(?AeCACCACCACTQA 

O. (it vn "homologue of SEQ.. ID NO. 68" 

^1 fi i ATtATTCCAAAAATAGTCAQ^ 



TCAI 

cao 



^ AA« 



1A TTGd rTGCbTTGGAGACCAAGCTCTAACAGTGGTTGGTATCCAAAA^ 
<£ fdAA <S^T^GCCAACCACATO^G AAGGCTACCQAAAGOCACTGCM 
7G AGA/ AmCGCCGTCCAGTTGTGACCTTTATCAATACAGCAGGTGCTrATCCTCGTGTCGG AGC^ 
OA ACCT GGTCAAGGGGAAGCTATCGCTCGCAATCT^ 

CC ATTA rrATCGCTGAAGGTGCTTCAGGCGGGGCTCTGGCTCTAGCTGTCGCXjGACCGTGTCTGGATGCT 
^ OG aaa> TTCTATCTATGCCATTCTCAOTCCACWAGGCT^ 
£ CC \TGG ^AGCAGCAGAACTOATGAAAATCACrrCGCATGAACTGTTAGAAATGGACGTGGTGGATAAGQ 

TO ^roMGTAGGACTrTCTAGTAAAC^ 
CG MJCA CCAa^OCACCACCACTGA . 

1 ■ * 

(LCB^lS "homologue of . SEQ- ID NO. 7 J" 

Ar JTcAbATAAGATTGGGTrATTCACAGGClX^TTTGATCCC 

AA Soc CdA^^^liTTTTTG ATAAG CTrTATGTGGGTATTTTrTTTAATCCCGACAAACAAG G ATTTCTT 
'A1C( JAAAXTCGlAAACGGGGtoClVVGAAAAGGCTrrG^KJA 
; CAT< jATGAATTGGTGGTCGATOTTGCAA^AA 
H> All JCGT :GGATTrGCAATATCMAGCCAGrmGATOCTACAATCATCAGCTOTCTTCTO 
A£ A r TA'Fl TaCATAGTCGACCTCAACATCTCTA^ 

GGA rATTGCCTGCTATGTTCCCGAGAGTATTrGGAGGAAAGCGGCGGCACTCG^ 



CCACXJA 



'homologue of SEQ. ID, NO. 72 . 

ATiACGATTfrGTrrcyrGGTTAT^^ 

r, joIAGTGAAGGAGAAAAATTAGCTCAGCAGTATGCAGGATTAGAGCAGGCTGaTCAGGTTGAT. 

I ATGQCTTGGAATCTTATTACAGCGTTCTTGGTCGTAAT^ 
„.J 'AAAGATGACCATAAOATTTACXiTrTATCAGCTAAATCAGGGTATTTCACAAGAAA^ . 
™ ioGT TCTAACX3AAAAGGGAGCTCKX:CJAGATTGACAAGAT^ 
AA' rTG< laTAGTTAAGTCAGGATCrGATnTT^^ 
GA< KJGd HVlClteGAGCACCACC^CCACCACCACTGA ' 



ACaG 
.TO TA' 



OA' TOG' 



"homologue. .of .SEQ. ID NO. 74" 

ATC TOIvtfCAATOTATAAAGAAAAA^ 
CG( CAGC lATATTCrrACTQQAAATCAQTGTTT^ 
GGA AATC TrGGTAGCCATCATTrraATAAGmCATCTACCXL\ATO 
TCA GCA/ (KJTAAACCACmAGTaTrcaTTATATCAAGa^ 
TAA CGG1 AAAT«SCTCTrroACGOtGT^rGGTrcGGAGCT^ 
CAC TGA1 TAAOTroGTTATCGGTGTrrrXGTCGGTGGTATTR^^ 
. ATc aToc aagtotacaacgtcatctcaaacatcccacc; IUI 1 1 lUATTQTTATTGTCTTGA^CTTACTCAAT 
H. CGC AGCl GGATTCTGGAATCTOATTTTTCCCATGAGCGTA^ 
is QTG KSCA AATtftTGCGCt ATCGTGACTTC 

gaUoatiottoccaaamtatcatgc^^ 

C AA 3CTT f ATCTCaTACO AAGCCTTCr TOTCf 1 i CTrCGGTCTTGGATTAGCGATTACAGTGCCAAGTTTG 
GGTDGTf rQArfTCGGATTATTCACAAAACGTAACAACC^ 
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t)8 



2 ere 7? 



^ ^Wr TGT( rnGGTATCCTTGTCCCTTTTCG^ 
^ ATAGA<?TCGACpACCACCACCACCACCACTGA 

^ 2cpi7S> "homologue df SEQ. ID NO. 75" 
0- ATaTAtUcCTATTATTAACC^ 
^ AC< JAAA \ACCAATCCAGCAATGTATTrGATGCCAGTTC 
% Or if QJ AGCrGTAATGCAGCGTrTGACAGGGATTTTAGTCTT^^ 
AO iGJij [TATCAAGTAGACTCGAGCACCACCACCACCACCACTGA 

'fcCFISO' . "hopiologue of SSQ. ID NO. 76" 
AT( ITTfdaTAGAAATAAATTATnrm^ 
AG LCAGATGGGATCTTTGATTAACC^ 
G& )CVt CTTTAmTTTGACAAAC^ 

TG( \TA% TTM^ACCTTGTGTACTTTGGTCTGGGGAATGGTCATAGG 
GA' TAA' 'CAGTTATCTAGTTTGATTATATCTAGTCAAACTATTTATAGTCGA^ 
AC' TfTT TAATTATCCTGCGCTCCAGAATTrGGATGTAGAAGCTACAArr^ 

cr uct .ttcttcaaaatatcctaaatagcgtatcaaatAgtg 

TGTT3 TTGAtTTTXJATTATGACTCCAGTTTTTT^ 
jk ATX iCTIX AAA(fAACGATTCTAAAGAGGGATCGCTTGC^^ 
in CGfJ iTTG(TC^TATATTA0TOQAGTrTC GATTG 
v AG' *ATTV TTPGGTTTAAAATATGCrrTAGriTr^ 
. GG< SCCA LGTATTGGTTTGATTCCTATGATCATCGC^ 
. AG1 X3A*] TATATOCTTGTTGTTCAGCAGGTAGATGGCAATATCTTATATC 
TO J AbG TCATCCAATCACGATmAGTTTTAeTrTTGT^ 
ATI GTCC CAGTGCCAACCAATtXnATCITGAAAGAAATTTCTA^ 
TAJ AATj ATGi^GMCGAGAAAGAGAATTAGCTAAGCTCGAteACCACCA 

HCBin "homologue of S£Q. ID NO, 77" • 

at< tatcJaagcUctttatcgaaaatat^^ 

CJ/ AGA( rrCTTjAAAGAAGCGGTGGAGCAAGAGA^^ 
MC IGGGi LAAAACCAGTGTTGCTAAAATCTTTGCCAAGGCTATXJ 
ACC TTOC AATAACTQCTATATTTGTCAAGCAGTGACGGACGGTAGTT^ 

gc/ Girn ctaataaixsgggtagatgaaattcgcgaaa 

CJ|C GfTA FAAGGmATATCATAGATCAGGTT^ 
ACC CTGjC 'AAOAAGCAACACA GAATG TAGTCrhrTATT^ 
C^A TTCt VTCCCtoTGTCkSAACGTTrrGAGTTTAAA 

PAT ATCT TAGAiMAAGAAAATATCAOTTCTGAACCAGAGGCTCTGGAAATCATTGC 
GflTGGAATOCGGdACGGCTTGTt^^ 

CTC CTAT ^Xn^AAGAAAITACTGGCACGATTAGCCTACCAC^ 
^ CAdCAGCATGTTCCCAAAGCTTTO 
l£ TOT3ACCGATCnTKK^CTATTt 

AG! TGAC TCnTtGTAGAAAATTTCGCACTTCCTC 

G AC CK^^TATTAAGTCTAGTTT^ 

GA^ ATCJa AGTCCGMCCAGCTCTATX^GGAGCGGTTGAAAATGA 

GGC CQf € ICAAACAAGAGCTTTlXn'AATGTAGGTGCGGTTC^ 

CA|G CTAC CKK3CAAAACAGTCTATCGTGTCG ATCGCAATAAAOTGCAATCTATCTTACAAGAGGCCGTCGA 
. AAA TCCT SaTTTAGCACGTCaAAATTTAATTX^GTTTGC^ 
GGTGGGjXGG^j^GCOTGCCT^ 

ACIBB^ "homologue of SEQ.. ID NO; 78"* 
ATC rTTC< jATT^CCMtAAGTTAGCGGTATC 
Q<X^QCT2TT(^CtTGGCAQ 

AAT XGT 3GAG6AAC^CCATTCAGGCTACACTTCGATTTTO 
ATTATCO TCTCTATGCCAATAGTTITGTCATGAAGAACCGT 
^ GGG CTTG WGAAGCGTCATCTTATGAGTaTQA 

GAGCQGC TAT^GTATTGGAGCJCTTGTTTGACAAGTTAATm 
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AAiGTTfjjAGCtGGTTGCTACCTTCCAGACGAAAGTTGTC^ 

CC1 AQGi :CT^TGTTCCTGAATGCCCTTCGAATCGCCCGTATGAATGCCCTCCAGCTC^ 
CIV GT<4< rAGAGAMAAAGGTCGCTTCCrrCCTXH^ 
CT/ ,TTAj' CTTGCCCTTACGGTAAAAGATCCTCTTACAGCCTTAAC^^ 
TA1 CtTJI GGGACTTATCTCTTGTTTAATGCAaaOATTAGCGTTTTCCrC 
AA' Apt LTtAuCAAGCAAATAACCTCATATCTCTTTCTAACTTGA 

GGi ictam k:caccatcgctatcttgtcaacaatggttttgct^ 
Ar cctcaqaaagctttaaaaaagttctaaatcctcatgattitgggqtttcagggga^ 

AGi IAGA iHTGGACAAACTCTrGAGCEAGTTTGCAAGTGACAAAGGTT^ 
An TCGjlTACACTTACTTTGCTGTTGCGAATCAAGAAGGAACCAAGrTAACTATT^ 
AA<XX5T[9T<MJaACCAAAAACAGTTTTCATGGTATTTGAGCAAA 
aa( nraxi ITCTATCAGGAAATGaGGTCGGACTCTTTGCAAAGAAT^SAGGGAGTTAAAGA^ 
tkl .CTCjl AAATGATCATCAATTrrCTGf AAAAGAAGAATTTACTAAAGA 
- : ' CAu' TTAATATrTTOACTG£TGATTACAAT^ 

: CCA( ATTCGGCTATCTATMTCAGTITrACGGTGGTATGAATGTAAATGCCAGTGAAGCAGAACAAC 
AGqCGCTGAGGAaXATGAAAAATACTTACAAAAOTTTAATGCTCAATTA^ 
TGI GTAp GGTA,GCACTCTAGCAGATGCTAGTGCTCAGATGAGTGCCCTCTTnW 
QV Ttrt CTTATTCCATTATCTnATGGTCGGAACCGTrCTGGTCATCTACTAC^ 

TA1 QAM iACCGTGAGCGCTTTATTATCTTGCAGAAAGTCGGTTTAGArcAAAAGCAAATC^AGCAAAC^ 
TCW ACa/ ACAGGTTTTAACTGTATTCTTCCTTCCTTTGCT^ 

AT^ TGCh TAGTCTGATTrrAAAAGTGATTGGTGTACTGaATACG^CTATGATGrTGATTGTGACCTTaTCT 
At C TGCf: CTATjCrrcCrCATCGCCTATGTGCTGATTTTCATGATTAOT 
GO AATj(Xnt!;GAGCAC^ACCACCACCACCACTGA 

■Cffltt . 'homologue of SEQ. ID NO. 79" 

^AAGATCAACTAAAGGCTTGGCAACCAGCTCAGTTrGACCGTTTTGTC 
CAATCACGCCTATXrrCTTTTGAGGTTTCTTTGGAAGCTTGGAAA 
» j JTrcTACfcGATAAAGTTGGCGTCTTAC^^ 

AO^cjjiGrripccAaATaTcvccTTGArrAACK^ 

AA1TGG1 GCXJACAGTTTIXnTCAAGCAGCKJAT^AAAGCCAGCAA 
AlU AAAT XK^TCCCAAC^G<X^TTCT^ 

.TTJT 3TTC nrGACTAGCOATGA<KJMAAQATGTTACOCUCAATCCq^ 

^ TTACK^AAGTTTAGI^TCGCWAGCTGAAGCAGAAAACrriXk^ 

^ ^GTOAAGGCCTGCTGACTTGGTrAGTAGCTAAGAAAAAAGAA^ 
AM TAGf CAACrTGGCAGATGATAAGGAAAAACAGGATX^GGTTTTArc 
GtiA GGAK CTCTTGCAGGTAAGAGTAAGAGTOATTCfACAAGAT^ 

cSc CAcf ^° <mCA ^ TCC ^ TCGAA ? A ^ 

5 Cs|eS4 i. "homologue of SEQ. ID NO. 80" 
ATGA^TTjCAm^VAAAATT^ 

cqc Tjrk rrrrrTTGGACJeAATGTTCGCGTA^ 

ATC^[TOTTOTTAA<3CACCTO^ATTOAcaJr^ 
XAA QOAt ATCGTCAAGOSCGTGATTGGAATCanXKJCGACACCATO 
ATC^ACAAA<3AAACG«AC{UGCOT^ 

TTC "CfclJ< ^A<§ATGACCGCITGOTTTtXiAGCGACAGGCGCCAGGTAGG 

^^G^GCTAAArrccajn^ 



o 

V 



c4 2,'GFEIS 
^ ATG3rA 
GAAP 



"homologue of SEQ. '.ID NO. 81".' 
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AbbcCAkGTGGATATTCAAGCGATTAGTGAAACGACTGTTGTCAAAG^^ 
CG' riCC JMAAAAATCAATGATITGAACGAGCCTGrc 

CA' DGTG }TTGArCKrrATTAAAAAAATAGAGGAAGAAGGTCAAGGTATTTCTGATO 
ATOTTA^CATOAMQA^ 

TT( 'AAA fCGAGGAAGCGATGAGGGAAGAAGCAGGCXjCTGATGACCTO 
AA \GTC VAGAACTAGAAGACTTGGGCTTGAAAGTTGAAACGAACTTTGATAI^ 
AA 3TAA rGGCl^ATGTTCAAACGArTATTC^ 
<X( rtkQ' iAGCATCAATCTACSAAATTGACACCAA^ 
TKiAAG }CCTTGCAACTGTtGGCn^AAAATTATCTTTACAACCGCT 



CA fllOTOAATt^TTATC^ 

or TTdC lAAGAAGGGCGCACKIX^TAAAACAGATCCAATGTCAAATAGCGAAC 




\ CACfi 



9 2>CFpl8 

* ATp f 



gtcaO 



Ar 'ATT CAGGTATGGATGGCGTGACTAGTTACTCTGA AGGTGATGAGarAAATCGCTATGTTGTTGTAG 
AricAOAACTflGAGCACCACCACCACCACCACTGA 

I homologue of SEQ. ID NO. 82/' 

Xattitgccattattttagcagcgggtaaaggg^ 
x}ttqcgggtatnrctatgttggaacatgtttt^ 
jcagttgtaggacacaaggcagaattggttoagg 

fcTQAACAGTTGGGAACTGGTC^^ 
ICTT&TCATTGCAGGAGATACTCCTITAATCACTGGTO 
CcArATC AATGATAAAAATGTGGCCACTATCTTOACTGCTGA^ 
ATI GTTC GTAATQAGAATGCTGAGGTTCTrCGTATTGTTGAGCAGAAGG ATG^ 

PlL iTCAj kGGAXATCMCACTGGAACATACGTCTrTGACAACGAGCG i I lXJriTCAGGCTTTGAAAAATAT 
CL ,TAC( AATAACGCTCAAGGCGAATACTAt AmCAGACGTCATTGGTATTTTTCCGTGAAACTGGTGAA 
AMGTtX tGGGcrtATACrTTGAAAGATITTGATGAAAGTCn^ 

CTG> lGTCAGTTATGCGTCGTCGCATCAATCATAAACAGATC 
AGAAGtt LACTTATATCGATATTGATGTTGAGATTGCTCCGGAAGTTCAA^ 
AJL lGGS ^AACGAAAAXTGGTGCnrGAaACTGTnTG^ 
GOV ,GCA< kJAGCGGTCATTACCAATTCTATOATTGAGGAAAGTAGTC 
CCI TATC CTCACATTCGTCCAAATTCAAGTCTC 
AG< f ATGT TCAATCCGTGAGAATACCAAGGCTGGTCATTTGACTT^^ 
A*( IQTT> ATTT|CGGTGCTGGAACTATTACAGTCAACTATG 

C f A CA^ LTGTCrrTGTTGGTTdAAATTCAACCATTATTGCACCAOT 
h GCTC GTTOAACTATTACTAAAGACGTGCCAG^ 
ATA AACji iCGAATATiTCAACACGTCrrCC^ 

CTCA ' ■* 
2. CJFE87 » "homologue of SEQ. ID NO. 83" * . 

atc tccaIagatJtctagtat^ 

cta cctt scaaaagaagcttacxigtitgga™ 

GCC TTTG rctTGAACTATTTTGGTXnXKjAAG 
AGC AAG1 TAICTTGaCTGACCACAATGAATTCCS^ 
CQC TOTT STAGEACCACCAOCGTX>1nGGCTAACrn^ 

GTT IGAl caqcgtcttcaatcgtttaccq^ 

CAC Gltt 3ATGCTTrCAGGTTTGATTTCAGATACCC 1 1UI IIIOAAATCACCAaCAACACACCCAACAGAT. 

aa^ atc/ rrocmxrrGAAtTGGcrGA^ 

AAC CTGC TACGAACfTGGCTAGCAAATCTGGTC 
CAA C6G> AATAATGTCCCTGTTGCCCAAGTGAACACAGTTGA^ 
GAAATOAAGCTGCAATGCAAGCTGCCAACG/^ . 
ATA rCpt ZMCTCAAACTCAGAAATATTGGCTCTIXK^ 
TTT< 'AAA nTGAAAAC^TCATGCCTTCCTTGCT 



TTGA \AGQI^AATACGCTCGAGCACCACCACCACCACCACTGA 



I- 



homologue of SEQ. ID NO. 84" 
VTTTCAAAfaAGATTAGAATTC . 
CATOCTTATC^ 
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GOiGGAkGGTCCCTATCAGTCTGCGGTTAAAAATGTTGAGGCTGACGGCCTA 
CC< \tm GCCAATGGCTtGGCAGCTTTTGAAGAGACTGACCAAGTGTCTO 



j ere ft (c*<J*<) 



Vj cQ' JOTTTOATTGCTACKIATrtTAGAAGAAGGTTT^ 
\o TCC AGC( ICAATMTCGTGAAGACGACTTGCGTATCTGGCTACAG 
\ AA( jAAT ^TTAGAAGAAGCTGGAAAGTTrTATGAGATTTTGGTGGTGGAAGCA^ 
. - ag( "dkG 1 rGATGTTCGCTTTGGTCGCTTCTTGTCCAAAGAAGTGAGTC 

^ ail :ga A -jctgagaagctagaGttcgccctcggac^ 

CT7 GTA^ [ATA AGATTCAAGCTATCAAGGAGGTCCTCCATGTTAGCAAG 
ACVGA 



o 



_ "homologue of SEQ. ID NO. 85" 

fAATfTA^CGATATTAAAGACTTGATGACT 
iATG' K3ACGGATGAGtrGCAGmAGCAAGAATCAAGCAAGACCTO 
TOjcTCC AGCACCCGTTGTAGCAaCaCCGAGTC^ 
AOJ lAGA VGTTaCAGCTCCAGCTGMGCAAGTGTCGCTACTGAGGGAAA 
qqj lOT d< JTTTACTIXjGCTGCTGGACCAGATAAACCTGCCTTCGTTAC^ 
GTC AAA< )AnGGTAATTATCGAAGC(^TGAAAGTCATOAATGAAATCCCAGGTGCT 
AA( :Gj5A UVTTCIXOTCTCTAA 
AA^b^a ^OCACCAOCACTGA 



ATC 



2.CFE91 



AGq 

\ TAT 



"homologue of SEQ, ID/ NO. 8.6" '* 
^AAATCXjAGTAGTGGTAACAGGTTATGGAGTAACATCTCCAATC^ 

_ ja atag'tttagcaactgggaaaatcgG 

At3"tGC*XAATCCGGC^^ 

TTT TGAt \ACTATTCtTTATATGCCTTGTATGCAGCCCAAGAGGCTGTAAACC^ 
A<k '<CtCl TAATAGOOATCXnrrtrGGTGm 
TCA GGrt CTTCraCCTTCATGAAAAAGGACCCAAACGTGTC^ 
AA1 AfGC CTTCTGGGAATGTACXXATGCGTTTrGGTGCA^ 
CIC ITCA rcAAATGATGCGATTGGGGATGCCTTCXX3Crc^ 
TGC GAG< SAACAGAAGCTTCTATCACACinTTTGCCATC 
iTCGAACTCGXGCTteGATCCCAT^ 
^TTdGTICTAGAAAGTCtTGAACACGCT^ 

TG(3tTa&GAJWTA^ 

GCC ATCA AACTAGCCTTGGAAGAAGCTGAGATTTCTCCAGAGC^ 

Ga^TGAAAAAGGAGAMGTGGTGCTATCOTAGC^ 
iAGTGirrrACAGGACATTTGCTGGGGGCTGCGGGTGC^ 
jt^TAACTirCrrACCAATGACAGCTGG 
JSsfeATOGACAAGGTTraGAGAAAGAAATT 
CAcIaATC CAGTTUTTGCTtrcAAACGTTGiG 



CCSflGAAqTCCTI 



^AljC^ACXmi 

*^gto4tgcg* 

TGTOG1 



^homologue of SEQ. ID NO. 87" • , 
ATSMCWTCTATGATCAACrACAAGCTC 
CCT 3 ATG I^GTTTCAGACACCAAGCGTTTTATGGAGCm 
TAA TAGt CTACpOTGAGTATAAAGVAGTCCTTCAAAATATCGTC^ 

GAT<iGGGACTTGGAAGAAATGGCCAAGCAAGAACTCAAAGATG<K 
A AT ATOA AGAAAAACTOAAAATlTroCTXXnTC 
AAT XGT jCMGCAQCnrGGTGGAGACGAAGCGGCACTTTTCGCKKjAG^ 

3CGG ^GCeCAAGGTTGGCGCTTTGAAGT^ 
AAC TGGi rGCTATGGniX^GCirCAGTCTGTATACTC 
OUjcGTG rrCCTGTCACAGAM 

A AGAGbTIGAATACGAGATTOATCCAA 
TGGtrGGA CAGAACGTCMTAAGGTTGCGACJIXjCC^ 
GAp KTQC AGGi&GMCGTACXCAGCAGAAGAACC^ 
GGT ]ACCACTTICCn^GATTXiCTCAGG 
AC? }TTCAGAAfcGGATCXGAACrTATAACTTC^ 
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O 

s 



•o 



2.CFBM 



CC TCC/ AAAACTAGATACGATTTTCTCTGGTAAATTGGACGAAGTTGTGGATC 

^CIE92a. "hQfflologue of SEq. ID NO. 88 " 
AT GGClfrACACTCTTA^^ 

AC ACT1 ACGGAATTGAATACCAAGACGAGATTATCATCGTCGATGCrGGGATTAAATTCCCAGAAG 
CI TGC7 IXIGTATCGACTATOTCATTCC^ 

71 ITA A rCACACACGGACACGAGGACXJACATTGGTGGGATTCCGTTCCT^ 
TA TTTA rGCTGGACCGCTTGCXnTGGCITrGATCCGTGGGAAACTCGM 
0C CAA; CiTTACGAAATCAACCACAACACCGAGTTGACC^ 
CG ACTC \CTCTATTCCAGAGCCTTTGGGGATTO 
(5A CtTT \AGTTCGACTTTACTTCAGTTGGA 
AAGGOC TCCTR^TCTCCTGTCTCA 

CG TTG^S rCAGTCCATTATGAAGATTATCCAAGGTATTGAAG^ 

J$ \JCT PCCGTCTCCAGCAGGCAACAGAAGCTGCTGTO 

TjT 7TA? /GAAAAGGCCATTGTCAACGGAATCGATCTTGGCTA^ 

OA QCCf AATGAAATCAAAGATTATCCTGCAGGAGAAGTTCTTATCC^ 

CT VTb(S lAGCPCTCTCTCGTATCGCCAACGGAACCCACGGTCAAGTACAAW 

TA PC™ rrCTTpTAGTCCCATCCCTGGAAACACTACTAGCGTCAACAAGCT 

CrCOTO TCGA vgttatccacggtaaagtgmcaatatccatacatx^ 

C rCATbcTCCGCTTGATTAAGCCAAAATACTrcATGCCT^ 
rCCA XCTGGACTAGCAGTGGATACTCGTGrrGTGAAGGACAATATCm 
GT 3C7T! }<XCTTACTGCTGACTtlAGCTCGTATCGC^ 
AA \TCG rATGQGTGAAATTGGCGCAGCTGTCCTCAAAGATOGTCGCGA^ 

CP IGCA itcgcaactgttgacttcaaatcg^^ 

TWTCT7 CAXGAOAGAGTCTGCMGACTrGATrX^ 
GC. VCTQt lAAAATAAGGATGCTAGCGTGCAATCTGTCAA" 
TC" ATGi LAAATACCGAACXITGAACCGATCATCATGCCGy 
CC^CCA X^CCACCACTOA 



AAAAAl 
ADTCC/ 



GA 



GAilAAG 



ATCttfCT^GC^AACAAAAAAGAAAAAATCAACAGTT^ . .__„_ _ _ . „ . _ . 

AA<pGCC IAOACXJATTGAAAAATATCTAGGC^GAA^ 
TQJ AGM ATCC^GTaTOTCCGTC^ 

jCCCTCTTATCAATGACTTGAAAAAAGAAGCT 
ACCtCGGi kCCGTGAAGGAGAAGCGAmCTTGGCATTTGGCO^ 
c^y xxxr GTGGTCTTCAATGAAXTC^^ 
GAi AtGC f ACTTGGT^ATGCCCAACAAGCTCGTCCk^ 
CT/ TTTT STGGjVAGAAGGTCAAGAAGGGCrTOTCAGC^ 
GAT TGAC CGTuAAAATOaAATCAaTGCCTTCCAGCCAGAAG 
AA( fGGA (CCaAACAATTK^TGCTTCCTTCT 
AAC GAA( iTCAAGGAAGTCTTGTCTCGTCTGAOT^ 
GA( 1CGT7 AGCGCAATGCTSXHTTACOT 
ATT ICCG rACTtGAAAAACCATGATGGTTOCCCAACAGCTCT 
TCAJaGGI TTGATTAOnttTATO^ 

itac^atcgttttggtagcmgtattc 
g^ tcccc^tc aggctattx:gtccgt^ 

GG/lWAC IGAICAGCICAAGCTAm 

GTT ntQ VTAC(^TCGCTOTtAAATf GTCTCAAAAA 

GTT KJAt 3GTtATCTTGCCATTTAf AATGATTCTGAC^ 

atgitsgtsaaaUggtcaat^^ 

aac \cw atta^aacctta^ckf aaaatggggttgg acgtccatc 

CAT PCA<? UAGfymTTATtiTTCGCCTGGCAGCCAA^ 

AaT VAGC rCATGGTTGAATATTTCCCAGATATCGT 

TGG KTGA raiC^AAGTGGGAA>WGACk:AGTGGCGACGGGTCATTC 
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CC> 

• ACC^ 



cqic 

Vt GCl 



a AA*rrA a r«Y;TAATC(jCCTATTCTATGGTTGCAATCGCTATCCAGAATQTGAATTT 



Batcgcgatcatacccaagcaatcgtgaaagaga^^ 

rf^ SSSaAG^^ 

<5CrCpACX^c|A0CACCA<^CCACTGA 

HCFS95 "homologue of SEQ. ID NO. 91" 

" TT(tCATCAQTQCTGGAATTGTGACATTTTI/»;i aau i 1 i Auim»y/\A» iia*\jvsv<~* «. 
AtlmAtAGA&G^GCAAATrACAGGCC^GCAGATGCATGAGGATGTC 



ru k<w "homoloque of SEQ. ID NO- 91 • 

>M STOfATrrdCATCAGTGCTGGAATTGTC 

IB 3GAC TOCTaIsaASGGGAGGT^ 
I? £S SSrAGCAATAAT^^ 

.*» SSaSc^aagg^atgggtoatgtgggaag™ 

lb TCT, TCGC^CAC^GGAATGQACTCTCrrGATTATCGGAATTGTGTATC 
TO atGCAAG^^^ 

AC :aTT TG AG CTTG G GG G ATTGTCTGGTAAAGG AAATCCTTG GAGCG AGTGGAAGGTIX3ACTTCTTCTT 
xf< [GGQlGnSGTCTTCT^ 

aci^cqactga; 

2.rT?vo« "homoloaue of SEQ. ID NO.' 92" 

at SsAtoSKmrcAcna^^ 

C, u kCAA ^ACTOAGCGTATTCTTTACTACACnXjGTAAAATCCACAAAATC 
Gu "ACA; [ATGGACTCjQATGOAGCAAGAGCAAGAACGTGGTATt^ 
TO ATG< IAACAACCACCGCGTAAAC^TGATCGACACACCAG^^ 

ac< »rra ctt(X}t^attggatggtgcggttacxjgttcttga 

AAaCAO TTGGCOTCAAGCAACTOAGTACGGAGTTCCACGTATCGTATTTGCC^CAAAATGQACAAAAT 
CfiC 1TCG' GACTilXXTTrACTCTGTAAXjCACACrrrCACGATCaTCTTCAAGCAAA 
TXSt ICAAT CGOTlrCTGAAGATGACrrCCGTCGTATCATTGACrrGATCA^ 
TA> CGA( JCTIOMAOGOAlATQCTiaA^ 

CGI GAAj lAATTIGATrGAAGCAGTTGCTGAAACTGACG^AGAATTCATGATGA 
GX JVTd; CTAACOAAQAATTGAAAGCTGGTATCCGTAAAGCGACTATCAACG^ 
^ GTGC TTCA<KXnTCAAAAAaVAAGGTCTTCAA 

m ACATtaabOCAATOOiACWI^ 

LGCCATTrGCAGCTCTTGCCTTC^GATCA 
\GrrC^GGTGTTGTIt^TCAGGrrCATAC»TATTGAATACnTCT 

ATOdrrCAAATGCACGCyrAACAGCOiJT^ 

J xtggJtttgaaagatactacaactggtgactcattgacagatgaaaaagct 

ftiCAAJl CAAwTTCCAGAAOCAQTTATOCAATTQAKKJTTGAGCCAAAA 
GGG1 ATtGCCCTTCAAAAATTGGCtGAAGAAaATCCAACATT^ 

CAW AGnOAAGCGAA<MTAGaTGCTCeTCAA<yrATmA<X» 
CGC OGAI TCTTdAAACGTGAGTCTGGTGGTAAAGGTCAATTCGGTO 
AOC AAGV AGGmAAGGATTCGAATTCGAAAACGCAATCGTC^GTGGTCTC 
AGC 3dn aAAV&AGGTITWTAGAAT^^ 

AAA C3GTA AGCTnTATOAICGTTCATATCACGAtGTCGACTCATCTGAAA 

act nrcci :ttaaagaagctgctKaatcagcacaaccagctatccttga^ 
act jrrc ^gaaoaaaaccttggtgatgttatgggtcacxjtaacicctcgtcgtggacqtgtagatgota 

TGG \AGC ACACGGTAACAGC^AAATCGTTCGTGCTrACX)TTCCAOTGCTQAAAltJTTC 
AGT mi XjTTGTGCATCTCAAGGACGTGGTACATTCATGATGGTATITGAC^CTACGMGATC 



t CTT 

aKc 



IAGjLI 
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* Akfr cacJtacaagagaaattattaag 



SLCFW? "homologue of SEQ. ID NO. 93" * • ■ 

AT IGCAkATT ACaATATC^ 

cc " i toc jttctpgttgg atcacaacaggtcctaaaacaa a agaact 
ac Igal acctAagactgtttqtctcaactctg^ 
TG }GAC ctggtgatgaagtcatcgttcxagccatgacctatacggcttcatg 
gg sagc aaggcctgtcatggtggatatccaaggagatacgtttgaci^tggact 
cc tatc vctoagaaaactaaggtgattatcccagtagagctwcagc^ 

TC (TCG lAGlTOraOAOAAAAA^ 

TA (TGT< JATTOTCTCTGATaGT gtcccacgctttgggatctac • 

j A {px TAOTCCTTCTCATTC^ 
aa wjcc \ATCCAGTT3ATTGATGACGAAGAGATGTAC 
CTi^GGATGCgCCTTGCCAAGATGCAACTXKjGGTC 
CA \CAt jACCQATATCATGGCTTCACTKjGTrrGGTACAATT 
l N or WVGG \(^TtGTGGACCGCTATGATAGTGGTTTTGCAGGTTCT 
TO VAAC rGTCGAATCTTC^CGCCACCTCTACAT^ 

cr< :ato ;rc wagaattggctaaagcaggaattgcaagt^ 

CAi SCCT iTAAGAATCTTGGATTTOATATOACGAACTATC 

ac* xno ctcttcatactaaattaagcgatgaagaagtaqact^ 

CTC i AAA UGTOCTAACTTTATCIAAAA 



■5 



I* 



1 



2 CP*»» - "homologue of SEQ. ID NO. 95" 
ATOITTtATACTTATITGCOTOO 

Ail' 'ACT( JATAAAATTCiCTAATCAAGATGAAAATrATATTTTAGT^ 
CTC tTTTi TATGGCCTTTGCOACCAAOCCAAAACAGTTCATCTTTATC 
^ * CrtfrATOnTGGTTGGTC 

^ CCATCA>lATATCCTATCAACGTTCIX^AAAAAAOT^ 

* CJ^< TCAi acgatgtcaaggggggcgcagcactgattgcgaaaato 
.in acc \taoj xxtggtcc^tgactttgaagg^ 
v£ caj x&ij ,f atcitcagatatcaag aaaatg a atcutgaaggcatt^ 
cac taatfj xxaajcgtctggacgmgaaacgaaacaatggcacaat 
g<p ttaiccck:atccctgccctcatcct^^ 
c/j gcm catctggaacccagataagaaaagagaadaacttgcactcga 

GA f . ,j t! 

f. : . . ■■ ■ " 

ZCMllOl. 1 homologue of SEQ. ID NO. 97" 
ATC ACC^CGAATTITTACATTTrGAAAAAA 
CAC CtCC riTGACAGAAGAAGAATTGGAATCnATCM 
TAG AGA1 ATtHlATCTOCXXrrraGCTCA 
CAA AAfiC fAAT^CCTCCAACGTOAAAGTAAATCr^ 
OTI 3QAJ AATGCACAA<XAGTCX5CCTACnCAAATCCTACT^ 
AQ1 TGOI rACAACTGATQGTTTTCTCTATC 

cqa nrrc ^gaaagctatgatatogaagctcttgtcaac^^ 

TAG aTA! itCTGTCTATrCTGATQAAGTTTACGACATOT 



^ TGArTTTltAAT^GTTOAGGGAAlX^TO 
t£ TCTrTfcA]Tn^ 

AAA ATGfc TOAGTCCTAGGCCAAAACCUCCCrGATAGCT 

AAC IXjG/ AGCOjmGCCCAT^ 

AAt CAG^ AATCjaTOCAGAAOTGATTCTTC 

AAA GCTt GAG^CCAC^CGACfcACCACTGA . 

2*CFpI0i "homologue of SEQ. ID NO. 98" 
ATG 3AAAtTrrCATTATTAACAGATGITGGTCAGA^ 

TCA ITAG \GCTpG ACGTACCATGATTATTTTAGCTG AroGGATGGGAGGTCATCGCGGAGGGAATATCGC 
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3 CfZ /ol- 



© 



TA STGA AATGGCGGTCACAGACqTCGGI^ 

0G r<SA/ TGGlJtCGCCCATTACCTAGAAATTGAAAATCAAAAGATTC^CCAGCTTGGTCA 
AC KGM GCAtGGGAACTACTTTGGAAGTCCTTGCTATTATTGATAAT^ 
OA ITCC CGTATCGGC1TGATTCGTGG AGAAGAATACCATCAG 

AA TVGt rCAACKSCTGGTCAATTGACACCAGAAGAGGCAGA A^CATCCGCAAAAAAATATTATCACCC 
AG rCTArTGGCCAAAAAGATG^ 

TAGTCACGGCTTGACCAACATGATTTCAGGCAGTC^ 
AGCAGATAAAACGGAGACACTTGTTC^ 
2CdrrpTTTGTATGAACGAGGAGGATGAAGAAC^ 

CFM03!^homGXoque. 



CTTGd" 

at rep 



CG 



que.ot SEQ. ID NO. 99" 
ATfcACG^TACAGATGAAGAAT^^ 

AG 



3 QTTtTTGTATGA ATTGCGAGATCGTTTGAAG AGAAATCAGTT^ 
Tp JTCA' TTCCATTGGCGGGGATCGTATGCTCTTC^ 
GTPCGGTTTATCGGTCTO 

LTTTGCAACTAGATACTGGGGGAAGGGTTTCrTACCCTGTTCTGAATG 
TT$AAAl\TCGTGAAGTTAAOATrrTCAGAGCACT 

JTTCCCTTTGAACGTTTTCGTGGAGACGGGCT^ 
rfcGTAGTACTGCCTATAACAAGTCT^ 
Art AAQ jGAGaTTGCCAGCCTTaATAATCGTGTCTATCGAACATTGGGCTCTTCC^ 
AG 3ATA^GATTGAACTTArTCCAAC^ 

TIX XZGTt ATaTTGAGCGTATTGAGTaTCAAATCGaCCATCATAA 
TA< « AG' TTCTGGAACCGTOTTAAGGATOCX?^ 
CkCCACJGA 



V V AAA" 




Alt i 

AC^I 
GA'TAC 
AA<5' 
TQJ1G 



roc c 



ACirnx 
cnccAi 



^homologpie def- SEQ. ID NO; 100" 
UVGAAArTAAATTTTCATCAGATGCCCGTTCAGCC^ 
vUACTAACCTTGGOACCAAAAGGTCGCA^ 
TGACKGTOTGACC^TTGCCAAAGAAATCGAATTC . 
[<{rrATCAGAAGTAGCTTeTAAAACCA^ 
,_ r _ ---.AGCTATCOTCCOTGAAGGAATOUAAACGTCACAGCAG 
tUC rGATT XJAAACAGCAGTrGCCGCAGGAGTCGAAGCTT^^ 
AGi AGtt rATCGCTCAAGTTGCAGCCGTATCTTCTCGTTCTGAAAAAGTrc 
ATX GAii LAAOirTGGCAAAGACGGTGTCATCACCATXX5AAGAGTCACGTGGTATC 
GTqGTAX fMG6AATGCAGTTTGACCGTt3GTTACGTTTCACAGTA 

TGj/ ccttcaaaatccgtacattttgattacac^ 
^ gaaagcattctccaaaGcaatcgkx^ctctto 

iA CTChnTGTTTTGAACAAGATTCG 
TQ^CCGI CGCAAAGCCATGCTTGAAGATATCGCX)ATCTTAACAGGGOT 

TtC AGTTPAMGATCKXJACAATTGAAGCrCTTGGTCAAGCAGCGAGAGTC 
AQCf ACGK riTATTGTAGAAGGTGCAGGAAAlCCTGAAGCGATrTCTGACCGTGTTX3 

T<$& AACTACAACTTCTGAATTTGACCGTGAAAAATTCCAA 
TGljAGCC GTTATTAAGGTrGGAGOCGCAACTGAAACTtlAGTTGA 
— ■* CGjfrACTCGTGCAGCra^ 

AGAAWXJtTCGTCAAATTGCTC^ 

AAAUtt iCTOAGCTKKJtATAGG Am 
ATC \TX& \TOCAGTTAAAGTGaGTCGTTCAGC^ 

CAA CA|Gl< AGCAGTCGTAGCCAATAAACC^GAACCAGTAGCGC(^GCTCCAGCAATC 
TGG SCGf GaTGATGCTCGAGCACCACCACCACCACCACTGA 

iCFktOS "homologue of SEQ. : ID NO* 1 01 " 
ATG mAAGATTGAAAOCGTATTAGATATm 
GTC KTTApCACrACAACTACAGCAMGtTATITITC 
AQACAC 



CACT TTTTJTTTTGCAAAAGGCGCTGCCT^ 
TAGfTTq^^ 
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Mi 



ACtcATbAGTTTGATTGCCATGGAGTTCTATGGT^^ 

OGWCTAAGGOTAAGACAaCaGCAACCTATTTCGCCTATAACATCTTATCTCAAGGGCATAGA 
^ TGTTGTX GACOATGAAGACAACTCTTGATGGCGAGACTTTCmMGTCAGCGTTGACAACCCCTGAGAG 
J TATTGAl 5CTCT^GAGATGATGAATCAGGCTGTGCAAAATGaCCGTAC(X2ACCT^TCATGGAA 
. < AGTCAA jCCTATCTAGTCCATCGAGTCTATGGACTGACCTTTGATO^ 

TQACti' ^TCGGCCCGATrGAACACCCTAGCTTTGAAGACTAriTCrACCACAAGCGTCTCrTGATGGAA 

AAtAco :gagcagtcatcattmcagtgacatggaccacttctcagtctto 

Mp/CCAfOATTTCTATGGTAGCCAATTTOA 

^iGGXVAACtCGCTGGAGATTATGATATCCMCTCATTGGCAACTTCAAGCAwAGAA 
^ 39? AC ' TG ^ 0TCTCC GT^QO AG ^ G, rcTT0AGQAC^ 
■k TC^OG-CGTATCGAAGTCCTCACTC^GAAAAATGGAGCCAAGGTCT^ 

GAT ACT! ^GAAAAMCTCATCAATGTGGTroAAACTCATCAAACCGGAAAGATTC 
iCAGJAAAlJMGGGAGAAAGTCGTCGTAAGGACTTTGGCCTCCTCCTCAAt 

pTCT TCTGACTGCTGATGACCCTAACTATGAAGACCCAATGGCCATTGCAGATGAAATTAGTAGCTA 
CVjCAA 1 ' GATCXHGTTGAaAAGATTGCGGATCGC 

TCACGA. lTTAGATGCAGTTATTATTGCGGGTaaGGGAGCCGAT^ 

™ VipC 'ACCCAGGAGATaCAGCCGTCGCAGAAAaTTATTTACTCGAGCACCACCACCA^ 
ilQt i'homologUie.of.SEQ. ID NO. 102" 

^!1 a IH!^Jccgcaagatit^ 

CMATG XTACCTAGCCAAAGACTTAATCTTAGATGGGGAAaAAGTG 
ACJACQ GACGGACCCGATAGCTGTAGCTCGTTTrCAGCGTCAAaCXUGAGCTATGGCA 
TCOTOAI ATCQTTCGGATAACAGATATTGtf TGAGGAAGACGGTCAA 
GWGGAITACACCTCAAACGCTATATCAAGGAA^ 

3ACi M^ATTCTCTr.GGCTATGCGClTaGCCCATACTCXjAGGAATrGTTCACAGGGACTro 
TATCCTTTtGACACCAGATGGGACTGCCAAGGTCACAGACTITQGGATTGCrGTA 

AG<3< IGACTOTGCAGAGTGATATCTATGCCATGGGGATTATTTTCTAtGAGATOT^ 
fcmTATT^TOGCAGCTTCrCTTATrrcGATACTATCC^ 

pCAGAaGCTAGrGAAAAGGTGGAA^ . 
^GMC^AACaAAAATTMTtnTO^ 
3GCC GGAAATOTCTGATGTTATOKXMAATTAAMGAGAAAAM 

GAA(ICTAATATAGAA<nTaTAOAAGTOACCUCAGCGCCTG^ . 
CAA> QTCCTAGAGCAGGTGAAAAGGTAGACCTAAATAAGACTA^ 

£CF]E10^ v "homologue Of SEQ. ID NO. 



TTATGGASA 



103" 



iCAATOT 



^^^^ TOT0A ^ CAA ^QTAGOTAGACAACCTCCTAGCTG^ 

err xrrti acaqtgttttctcccagttcgata att* ty? a r a iv * tv-> a «^wi-r-^C^«\i ^ ; . tv__ „r 

ACT TAAG 3TCG, 



:AC^AACTAGGTGGTCMGTTGCTACTpAAATAGAMTCCTO 
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J CP* /O? (co^J 
GAllGAcbAGTTTGTCOT 

GTMGT :htatccttcgactgg 

AC ZGAT -TTG^GGCTGCTGAAAGTCCTTTATTMCAGATTTT^ 

CX3^CJGATGTGACCCTAAGTTATGGTAAGCATGTCATCACTCATOATTTACTC 
CA 3GTC rrGCrGGCCTACGCATGTCTAGCTTTCTCAAAGGT^ 

Oi \CTT CTGAGAAGGACrrGGTTACATTTCTAGAAGAAAATCGGGAAAAATC 
N AA \CCT rGlTACCAGAACGCTTGGCCGAATTTTTTO 

T& KAAA SGAACGAGA AC^CTTGTCCAGTCGATTAAAGAACTTAAAAT^ 
TGCi IAAGTC£TTTCTTaCGAAGGGTGGAGTCACT 
* A A XtfTG 3TACCTGGCCTCCACTITGCAGGCGAAGTTATGGATATC 

Tpi LCtT< TGCCCTCTGTACCGGCTGGGTGGCGCkJAAGTCTGCATTATGATCTCGAGCACCA 
- GjUCTGA 

2CP]!108 "homologue of SEQ. ID NO... 104" • . 

AT( SCTgAaATOGGAAGACTTGCCTGTC 
Cp AAA iGAAPGGTC 

AC' TCTC CCATCTITCTGATCTTGAGCATTTGGATCAAGTrG^ 
A& ^qp? rGTGACCCAGTACAACCGTCGGTTCAAGATTTGGAAGTTCCGTAO^ 
AA- WVAA GGAAGTCTGGT GACTTCTGCTAACGATAGCCGTAm 
TCC :GTTI GGACGMCTGCCItLAATTOTTGAATGTCCTTAAAGCn^ 
GA IGTG :CACGTTATACAOAGttlGTATAGCCCT^ 
H CCT 'CtCC AGCCAGCATCAaCTACAAGGATGAGGaCaCCATCAT^ 
CA< >TTG> lTCAGGCCTATGTGGAGCATGTTCTTCCT^ 
OT, TAGT TTCTTTGG<KIACATGAAA ATCATGTTTC^AA 
AC( !ACQ/ ICGACTGA 

1.CFU09 "homologue of SEQ. ID NO. 1.05" • 
AT(IACtA0TCCACTATTAGAATCTAOAC(jCCAAC'rC(XjTAA^ 
Gil CGGil 'AOGQ^TCTCGAAACTGCTTCTCGTTTCG 
CTI CCAC CCnmGATAGACCTCOrmmSOTGmAAGCtAAAAAGC^ 
CTC AGO .mAAAAGCAGGTTGGACCATTGAACGCrTAACGCTCGTCG 
ACT Cm GAAATCACTTCATTTGACACTCCTCAGC^ 
H» ACI TCTC CGATCAAAAATtnXKJCCGTTTTATCAATGGACTGCTGAGCC^ 
CO^GCA<iCACCACCACCACCACTGA 

. fi.CFEU;i ''homologue of SEQ. ID NO. 107" 
ATC AG^AaCGTGTAACGATTATTOATGTAAAAGACTATO 

^ C^ATCAGGAAAAGGGAAAATCGCTTTCTTACAATTGCGTQA 
TOT 0AC1 TTTAjVACCAAACTTTGT^AAAAATTTGGTGAAGAAGTGGQACTTC^GA^ 
AA/ ,CGG TGAGCCAAOAAACGTCTGTTTATGTGaCAGGTATTGTC^ 
GCI ATOA GTT^GACATCACAGAWTGGAAGTGATCGGTGAATCrCAAGA^A 
AA< ACGC lAACAGACrrTTrGATXKJATAACCGTCACTTGTGGC^^ 
. GCA AATC CGTAACGCTATTATCrATOTAACTTATGAOTtCTTTOACAA 

GdC CAA1 ICTTT^GGAAATGCGGCAGAAGATTCTACAGAACTCTTTGAAACTO 
AGC CTAC TTGAQCCAATCAGGTXJAGCnTACCTAGAA^ 
•mnl-njOC ^GTnTCCGTCCTGAAAAATCAAAAACACGCOT 
'WrCATACTTOAC^CATCATtUGTCGaTGACTTa^ 

> Trt^bG^TTCAAAOTTAT^CITACGATCAAG^ 

STGTGCCMCATTTGTCATGAACTATCCAGCAGCCAl^ 



"N- 



3. 



CCTtCGCAC 
ACTCQAGG 
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%CB$liz "hoinologue of SEQ. ID NO. 1.0,8" 
ATOTCT&AAAi^TTA^ 

CG< JTTA UUATQCTAACTTCTITATCAA.CAAGGGAGAAACTTTCTC 
GA< UAC \ACrATTOGTCGTGCrATCATCGGTC 

Ou iAXQ otaAtggtaagaaatcgcgtgaacaagctgcggaattgatiwtcgm 

s ac r ^txjcrocaagtttgaatgaacgtgcgactgttgattatatt^ 
iiaS ttaaGgatgaagaagaacgtaaagagaaagttcaaagt^ 

to otcatggaaccagactttgttattgcagatgagccaa 

AG' CTTC AACTTXjCTCAAAAAATTCCAAAAAGAGCTCGGCTTGAC^ 
C6( iTTCT TCGCTTTATTTCAGATCGTATCGCaGTTATTTACAAGGGTGT^ 
GAj iGaA rTGmAACAATCCAATTCACGCATATACrCAAGCCTTC 
rrCGAAQBTAAGAAGGTGTTGAAGGTTTACGACCCAAGTC 



I^S AG' 



A^'U 

At< tat3gtagaaatccgtccaggtcactatgtttgggc^ 

GG; ^CTAj lACCtCG AGCACCACC ACCACCACC ACTGA 




ATC rAjL&T^OT 

GCOCTCC TTAGTCCTGAGAGAC-AAGCTAGTCre 
OW JCTGJ LGTGAGAATGTGGTGGGAACTtTTTCrcTGCCTTATT 
TCaGGAj TACACCGTTCCCTATCTGACAGAAGAACC^ 
AT( Ato AGCGTGCAGGTGGTTiTACTGG^^ 
ATC AAOj TCC^TCCTAAACTAGCGCAAGAGAAGAITO 
CO ATC> AGOCTArCCTTCTATXXJTTAAACGTGGGG 
A<t( ICGA LCCA^CTTTCTCGTTGt^ATATTCATGTC 
M< ACO LTGCTGGAAGCCrroAAACCAGTCrrrAGAAG 
C7C TCCtf ACTAteGTOACTGATTCfCrre 

A*J lGGA X^A^ACGAGAQATTGCGQAGAAAATTGCGTIXSGCrAG 

ccc ago gqtactc^taataaaggaAtttttaatggtattgatc 

GQC GTG( CATCGAAGCTGGGGCCCATGCCTTTCCC^ 
O AC GCIX GACCTTGAAAGAGAAGAATTGGTCGGrGA 
TGC C1?G7 ATCQfiKXTCAACCCACGTGTAGCTC^ 
tac ccc5 GAT^TCGTGTCCATCGGTCTTGCT^ 
aTCCAGC AAGGCGACaTGAAACTACAGGCCAAATCCCTAG<^^ 

ttc ctcc :ctagtagagcgcctcatct^ 
aa/tttaagatcagc<jgccgcactcgagcacc^ 

2CFE114 



ATGCGA/TTi 
AAtTCG/ra 



TTACJiITC/StAGAA^ 

}MTTTTVAGATAtTQTGGTTCGTOATTACQ 
TTTlK^GTXTTAAGAGrrrAGAACLAGCGTTTC 
TAT IGAT rGCTQAGGATACAG(nX>AGAGAGTGAAACAGGCGGCGCATG 
ATC AAGC AGAGGAAGAJGCGCAACGCTTGTTGGAAGAAGCTAAATATAAGGCAA^ 
AAC CAA( TGATAATGCTAAGAAAGTCGCTGTTGAAAWGAAGAATTGAAGAA 
ACC \ACC TCTCAAATCTACAATTGAGAGTCAGTTGGCTATTGTro 

cgt 2CA4 ^gctacttAtxhtcAaaccaotgatgaac^ 

AAC CGA1 rCCAGCTCCAATTGAAGAAGAACCAATTCATATO 

M^CGTATTGAGGTAGCCGATAAAGAATTGTCTG 

ma^ct4^ 

VT^A \GAAjCCAOAAGC^ 

U ''homologue of SEQ. ID. NO. 111" 
t^AKTCGXCAACTAATTTAfrrAAtGGT^ 

ACdtAAGCGAAACGAGGGGAGATTAGAGGCGCTAGAAGAAAGAAAAGAAGAACTATACA 
OTA^TGATGAAGTAGAAGCTGTAAAAAATATGCACTTGAtTGG 



AOAATTACAAi 

A<Jrtr r -~ 

■ TGt, 
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CchGAklGGAATCAAAAATGGGTCGATm^ 

}CA6AAGGCTATAACCATTCATTTCGTTTTCTCAAGGCCAGTCATCAAAT^ 
™ rM ^GAmAAOAAGATATTGCGGCMTTCGCAATGirmGGCAGACTTAGAQAAGCAAGAATCT 
AA WAT \GTGGTCGTGnCTTCATGCmGGAmATnGAGGAACnCAGCATAGAGTTGCTGAAAApC 
AG U.CA 3TATGGTCAAGCCnGGATGAAATTGAAAAACAATTAGAAAATATCCAATCTGAATTTTC^ 
IF 'GTA> k cCTTGAATTCATCGGGTGACCCTGTGG AAGCCGC AGTG ATTITGGATAATaCAGAAAATCACA 
TTi TGCH iCTTA'AGTCATATTGTGGATCGTGTTCGAGCCTrGGTC 

CA- LTTAi 'AGGAmGaAAGCCGGmTCGTAAACTAATTGATGCTAATTATCATrrrGTTGAAACGGATAT 
TCi la!g1 SCGmCCACTTGCTITATGAAGCATTCAAGAAAAACCAAGAGAATATTCGTCAGTrGGAATTG 

GA vSt XXX^TATOAOAATOGACAa^ 
AA OTG nXJCEEAGAAAQTAGTGGAAAATCTACnTGCAACTC 
GA STAA rAcHrATTGGGAGAAGATATTGCACGTTTGAACAAGACCTATnACTTCCTC 
AG EAT }TTCGTCGTATTCAGACAGAATTAGAGAGTTTTQAGGCAGCrAT^ 
AA< JaAGAACCSaaCCCAAGCTTATTCAGTTCTTGAAGAAAA 

Afc LTTa iAGATGAGCAAATTTCAGTTAGTGAGCGCCTGAC5aCAAATTGAGaAAGaTGATaTTaATQCAC 
GTC ;AA A ^GGCCAATGTTTATGTCAATCGTGTCCATACTATCAAGCGATAC\TGGAAAAACGCAATCTGCC 

ag< jtat x£acaaactttcttgaagttattctttacggcaagcaata^ 
tax !aad laaaaatgattaacattgaatctgttacccgagtixhttj^ 
tn aga/ acggaaacttataatattgtacaatatgcaactttgaca 

CG< rTAtC IGCTtlAmGATGAACGCATTCAAGAAGCATTTAACGAAGCTTTAGATAU 1 llGAAAAAGAAT 
TTC ATT> TCACGCTTC^TTTGACAAGATTTCTC 

CT1 TGTI \CCTtATATGAGAAAACACGTGAAACGATTCGTTTTGCGaCCGCACrc 



CACCACGA 



3 



II 



Q.CF&116 " homology- of- ..SEQ. ID NO, 112," 
ATCfCfrriTATdTTATA^ 

AAi lTCT( AACTACAGGAATCGAGGTAGAGGGtX3TCGMTCACCAGCTGCTGGTCTCT 
CGC ITOA< SGIXHTO lXnTGCMAAGATO 
GAJ ,Ga6i X3TGAQATGGTTTGTGGTC 

GAC IGTCX iTATOGCTOATAACTACAAA ATCAAAAAAGGAAAM^ 
TCI GTTC ^CTTQGTGAATTGGGLIATTTCTO 

TtG CQTC AAGATGCCGTGCGAGGTGAGGAAGTCnTrrGTTAOCTAGA 
TTI CCAT ^CAroAAACXGTGCA^ 

TG/ CAA( fGCAQTCAACTTTA AAGAATTTACTCTAACAGAA^ 
GTC AGd/ TTGAGACAGAGAAGGCGCCTTACTATGCAGCTCGTATCTTC 
GTC CACfc ATGQtt^AAAACCTTCTCATGAACGAAGGAATCCGTCCC^ 
CA/ CTA4C ATOCTGCTCTArrTtGGTCAACCAATG 
TCC GTGT QCaibAAGOCKXJTGCTGG^ 
GA/ TGA( ICTAGTCATCACTCTCGCAGACAAGCC 
QAJ ATC1 CTGAAAMTCTAaTCGTCTTGTCCTTGAAG 



AAC TGG1 CGOC^AACXJTTCGTTCTCAGTCA 
AAI GAAC fCCCTttjATGCGGGAGCTAGOT 
GTT rCAO CGGGTGAGCmATACTTCAGATGTAGAAGTTrC^ 
CdC AACH GAGCTGTCTTATGCntJATGTAGAAGACGTCTTCCOT 
CAC ACAC CTTT'ACAGTGAGAGtCCCACGTO 
AAI TGCT XjTAtCTATGGTTATC^CCGCTTOCCAACTA 
TTG \CA€ CCACACAAAAACTCCGCC<JTCAAGTTCGTACTATTGCTG 
TCA XTA rACTCTAACAACTarTOAAAAAGCAGTTC 
CAT jTGG CCAATGAiHGTGGATCGT^^ 

CCT VCAA CGTGpCTX^TAAGA^TAAAAACTTGGCCC TTrAC GAGATTGGA 

TAA TCCA \AAGMGMCTTCCAAATGAAATCAACAGTm 

AAlAGATT rOCAMCAGCAGCAGntXAGTT^^ 

TCG rm< JGACTCGAA<nAACCTATAC^ 

GTC imp 3ACT<DGGTGACCAAGTTCTTCGTTTCCT 

TAT X:CA< jAAAteTATCTGGCTGAGCTTMCCn^ 

TTflpTAGfAATcUc^^ . 
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\ AT CAAC AAGTTGTAGAT0CTATCCAAGCTGCCGGCGTCAAACQTTrGLA.CAOATATCAAACnx^rTTGA 
CT rCTC ^GGTGAGAAATTGGGACTTGGTATGAAGTCAATGGCTTATAGCTTGACCrTCCAAAATCCAaAA 
Uf^ 0/ TAGC TTAACGGACGAAGAAGrcGCACGCTATATGGAAAAAATCCAAGCATCGCnrCGA^ 
• M TGCfi GAAaTGCGTCTCGAGCACCACCACCACCACCACTGA 

i CF8II7I "hipniologue of SEQ. ID NO. 113" 
AT 3tTAbAAAA(^TATTAAAAAAGTCCrCGTTK^ 

GG TGCT JAATTMCTAAAQACTATGCAGOAAAAAATCCAATCTTAGTTGGGATTTTAAAAG<j 
<tt ITTA] 'GGCTOAATTGGTX^AACATATTGATACA^ 

' O CATGOT aGAA^GCAAGTAGTGOTGTTATCAATATTAAACAAGATXJTGACTCA^ 

5? <*I^^ATTtOTAGAAGATATCATTOATACtt^ 

AA A.GAC AAGC^GCTT^GTTAAAATTGCAACXnTXnTGGATAAACGAGi^ 

AG GCaC ACTA|TACCTGGTTTACTATCCCAAATGAGTTrGTAQTAGGTrATGGm 

Wv TTAWXJ MTdrnanTATATTGGAGXATTCAMGAGGAAO 
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SEQUENCE LISTING 



<110> Dougherty, Thomas J. 
Pucci, Michael J. 
5 Dougherty, Brian A. 

Davison, Daniel B. 
Bruccoleri, Robert E. 
Thanassi, Jane A. 

10 <120> NOVEL BACTERIAL GENES AND PROTEINS THAT ARE ESSENTIAL 
FOR CELL VIABILITY AND THEIR USES 

<130> 30436. 44USU1 

15 <140> Not yet known 
<141> 2000-12-30 

<160> 226 

20 <170> Patentln Ver. 2.0 

<210> 1 

<211> 708 

<212> DNA 

25 <213> Streptococcus pneumoniae 

<4 00> 1 * 
atgatttatg caggaattct tgccggtgga actggcacac gcatgggaat cagtaacttg 60 
ccaaaacaat ttttagagct aggtgatcga cctattttga ttcatacaat tgaaaaattt 120 

30 gtcttggaac caagtattga aaaaattgta gttggggttc atggagactg ggttttacat 180 
gcagaagatc ttgtagataa atatcttcct cttcataagg aacgtattat cattacaaag 240 
ggtggtgctg accgcaatac aagtattgag aacatcattg aagccattga tgcttatcgc 300 
ccgcttactc cagaggatat cgttgttacc cacgattctg ttcgtccatt tattacgctt 360 
cgcatgattc aagacagtat caaacttgct caaaatcatg acgcagtgga tacagtagta 420 

35 gaagcagtgg atactatcgt tgaaagtacc aatggtcaat tcattacagg tattccaaat 480 
cgtgctcacc tctatcaggg acaaacacct caaacattcc gttgcaagga cttcatggac 540 
ctttatggat ctctttctga tgaagagaag gaaatcttga cagatgcatg taaaatcttt 600 
gtgatcaaag gaaaagatgt agccttggcc aaaggcgaat actcaaatct gaagattaca 660 
accgtaacag atttgaagat tgcaaaaagt atgattgaga aagactag 708 

40 

<210> 2 
<211> 558 
<212> DNA 

<213> Streptococcus pneumoniae 

45 

<400> 2 

atggctaacg taattattga aaaagctaaa gagagaatga cccagtctca ccaatcactt 
gctcgtgaat ttggtggtat ccgtgctggt cgtgccaatg caagcttgct tgaccgtgta 
catgtagaat actatggagt cgaaactcct cttaaccaaa tcgcttcaat tacgattcca 

50 gaagcgcgtg ttttgttggt aacaccattt gacaagtctt cattgaaaga catcgaacgt 
gccttgaacg cttctgatct tggtatcaca ccggctaatg acggttctgt gattcgcttg 
gttatcccag ctcttacaga agaaactcgt cgtgaccttg ctaaagaagt gaagaaggtc 
ggcgaaaatg ctaaagtggc tgtccgcaat atccgtcgcg atgctatgga cgaagctaag 
aaacaagaaa aagcacaaga aatcactgaa gacgaattga agactcttga aaaagatatt 

55 caaaaagtaa cagacgatgc tgttaaacac atcgacgaca tgactgctaa caaagagaaa 
gaacttttgg aagtctaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

558 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



<210> 3 
<211> 1353 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 3 

atgggtaaat attttgggac tgatggagtc 
gaattagcct ttaaactagg acgttttgga 
gcgccgaaag tctttgtagg acgtgacaca 
ttggtggcag gtctccttt'c agtagggatt 
ccagcagtag cttacttggt tgaaactgaa 
agccacaacc cagcccttga taacggaatc 
gatgatgaaa aagaagcaga aat?tgaagcc 
cgtccaagtg cagaaggctt aggaattttg 
gaaggatacc gtgtgtcaac tggaactcct 
gctaatggag cagcttctac cagtgcccgt 
acggttatcg gggaaacacc agacggtctt 
ccagaagccc ttcaagaagt ggtcaaagaa 
ggagacagtg accgcttgat tgctgttgat 
attatgtaca tcatcggaaa atacctttct 
gtgacaactg ttatgtctaa ccttggtttc 
aaggcagtta ctgcagttgg tgaccgctac 
aaccttggtg gtgaacagtc tggtcacgtt 
ggtcaattat cagcagttca attgactaaa 
gagttggcgg cagaagtaac gatttatcca 
gtcatgaagg aaaaggccat ggaagtgcca 
gaagaaatgg cggggaacgg ccgtatcctt 
cgtgttatgg cagaagcgcc tacaacagaa 
gatgtagttc gtgctgaaat tgggattgac 



cgtggagaag 
ggctatgttc 
cgtatfctcag 
cacgtataca 
ggagcaagtg 
aagttctttg 
ttgctagatg 
gtagattatc 
cttgatggaa 
caaatctttg 
aacatcaacc 
agtgggtcag 
gagaatggtg 
gaaaaaggac 
cacaaggcct 
gttgttgaag 
atcttgatgg 
atcatgaagg 
caaaaattag 
gctatcaagg 
gttcgtccaa 
gaagtggact 
taa 



ctaacctaga 
ttagtcaaca 
gggaaatgct 
aacttggtgt 
ccggtgtcat 
gcggtgatgg 
ctgaggaaga 
cagaaggctt 
tgaaggttgc 
cagaccttgg 
ttaatgttgg 
ctattggttt 
acatcgtcga 
aattggctca 
tgaatcgcga 
aaatgagaaa 
attacaatac 
aaactggtaa 
ttaatatccg 
ccatcatcga 
gtggaacaga 
actatgttga 



actaacacca 
tgaaacggaa 
ggaatcggcc 
ccttgcaaca 
gatttctgct 
cttcaaacta 
cactcttcct 
gcgtaagtat 
cttggataca 
tgcccaattg 
ttcaacacat 
ggcctttgat 
tggtgacaag 
aaatacaatt 
aggtattaac 
atcaggctac 
cacaggtgat 
gagcttatca 
agtggaaaac 
gaagatggaa 
gcccctcttg 
taccatcaca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1353 



<210> 
<211> 
<212> 



4 

705 
DNA 



<213> Streptococcus pneumoniae 
<400> 4 

atgaaaaaaa tactaattgt agatgatgag 
atgaccaagg aaggttatga agttgtaact 
tttgaagcag agcaaccaga tattattatt 
ttagaagttg ctaagaccat tcgtaagaca 
aaagatagtg aatttgataa ggttatcggt 
aaacccttct ccaatcgtga gttgcaggcg 
cctatgccag tagatggtca ggaagcagat 
ttagaaattg ttccagacgc ctacgtggct 
catcgtgaat ttgagctttt gtatcattta 
gaacacttgc ttgagactgt ctggggttat 
gtgactgtac gacgtctgcg tgagaagatt 
ttgacgcgcc gtggtgtagg gtattacatg 

<210> 5 
<211> 1107 
<212> DNA 

<213> Streptococcus pneumoniae 
55 <4_00> 5 

atggaagaaa ttctctgtat tggttgtgga gcaaccattc agacgacaga taaagctggt 60 
cttggtttta ccccccagtc ggcacttgaa aaaggtttgg agactggcga agtctattgc 120 



aaaccaatct 
gcttttaatg 
ctggatttga 
agcagtgtgc 
ttggaacttg 
cgtgttaaag 
agtaaacctc 
aaaaaatatg 
gcatcgcata 
gactattttg 
gaagatacgc 
agaaataatg 



cggatattat 
gtcgtgaagc 
tgcttccaga 
ccattcttat 
gggcagatga 
ctcttctgcg 
aacctatcca 
gcgaagaact 
caggtcaagt 
gtgatgtccg 
ccagccgacc 
cttga 



caagtttaat 60 
gctagagcaa 120 
aattgatggt 180 
gctttcagcc 240 
ctatgtaacg 300 
tcgttctcaa 360 
aattggggat 420 
agacttaacc 480 
catcacgcgc 540 
cacagttgat 600 
agagtatatc 660 
705 
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caacgctgtt tccgtctccg ccactacaat 

gatttcctca agctcttgca cgaggtggga 

gatatctttg attttaatgg atctgtcatc 

gatgtcctct tggtaggaaa taaaaaagat 

5 attagccagt ggctcatgaa acgtgcccat 

ctaacttcag cacaaaataa acatgccatt 

cgtaagggcc gcgatgtcta tgtggtcggt 

aatgctatta tccaagaaat cacgggtgat 

gggacaacct tggacaaaat agagattccg 

10 ccgggaatta tccaccgcca ccagatggct 

gtcagtccta aaaaggaaat caagcctaag 

tttttaggtg gtttgggacg ctttgacttt 

ttctttgata atgaactcaa actccatcgt 

gataagcacc tgggaactct tctgacacca 

15 aggctagtcc agcatgtctt taccattaaa 

ggatggattc gtgtaacagg cacagcaaaa 

gtcgtcacac gaaaagcaat tatttaa 

<210> 6 
20 <211> 1461 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 6 

25 atgtatccag atgatagttt gacattgcac 
gtttactttg accaagggat tcacaataag 
cagcctttta agaacggcta tgcggttttt 
gaagacttgc gtttttcaga tagtgatata 
gcgttcttgg attaccttcg caatttcaag 

■ 30 ggggatttgg tttttgctaa tgaaccgatt 
cagttggtcg aaacggctct tttgaacatc 
gcagctcgta ttcgttcggt tatcgaagat 
gctcaagaaa cggatgcggc catctgggga 
ggaaccagca acgtgcgtgc gggtaagctc 

35 catgccttgg tacaggttta tggcaatgac 
cacaaaaatt gtgtctttct tgtggatacc 
gccattcagg tggcgcgtga gctgggtgat 
tctggggata ttgcctacat ttctaagaaa 
acagaggcta agatttatgc ttctaatgat 

40 atgcaaaagg ccaagattga tgtctggggt 
cagccggctc ttggggcggt ttacaagatt 
cgcaatacga ttaagctgtc taataatgcg 
gtgtggcgca ttaccagtcg tgaaaaaggt 
ggtgtggata ttagcgacat gacagaaatc 

45 aagaagacgg ttcgtaattt tgatgccgtt 
atattagttt acaacttgcc tagtttgact 
gacaagttgt gggatgagta taagcgtgtg 
gcgcgtgatg tatggcaaga taagatggac 
ggtgaaggag aagaagaatg a 

50 

<210> 7 
<211> 852 
<212> DNA 

<213> Streptococcus pneumoniae 



gaaatcacag atgtccagtt gacgaacgat 180 

gacagtgatg ctttagtggt caatgtcatt 240 

ccaggtttac cacgtttcgt ctcgggcaat 300 

atccttccta agtcagttaa gtctggtaag 360 

gaagaaggtc ttcgtccagt cgatgtggtc 420 

aaggaagtca ttgacaagat tgaacactac 480 

gtgaccaacg ttggaaaatc aactctaatc 540 

cagaatgtca tcactacttc acgcttccca 600 

cttgacgacg gatcttatat ttacgatacg 660 

cactacttga cggccaaaaa cctcaagtat 720 

acctatcagc ttaatcctga gcaaacccta 780 

atagcaggag aaaagcaagg atttactgct 840 

agcaagcttg aaggagctag tgctttctac 900 

ccaaatagca aggaaaaaga agatttccca 960 

gataagacag acctagtcat ctcaggccta 1020 

gtcgccgtct gggcaccaga aggcgtcgcc 1080 

1107 



acggacttgt accagatcaa catgatgcag 60 
aaggcggtct ttgaggtgta tttccgccaa 120 
gcaggtttgg aaagaattgt gaactatctt 180 
gcctatttgg agtcgcttgg ttatcatggg 240 
ttggagttga ccgttcgttc tgcccaagaa 300 
gtgcaggtgg aaggacctct agcccaatgt 360 
gtcaactacc agaccttggt ggcgacgaag 4 20 
gaacccttga tggagtttgg gacacgtcgg .4 80 
acacgcgcag cggtgattgg tggcgccaat 540 
tttgacattc ctgttttggg aacccatgcc 600 
tatgaggctt tcaaggctta cgctgcgacc 660 
tatgataccc ttcgcatcgg tgtaccagct 720 
cagattaact ttatgggtgt gcggattgac 780 
gtccgtcagc aactggacga ggctggattt 840 
ttggacgaaa atactatcct caatctcaag 900 
gtgggtacca agctgattac agcctatgac 960 
gttgcaatcg aagatgaaac tggtcagatg 1020 
gaaaaagtgt cgacgccagg taagaagcag 1080 
aagtcagaag gtgattacat cacttatgat 1140 
aagatgttcc atccgaccta tacatacatc 1200 
cctctcttgg tggatatctt caaagaagga 1260 
gacattcagg attatgcccg taaagaattt 1320 
ctcaatccgc agcactatcc agtggatttg 1380 
ttgattgata agatgcgcaa ggaagccctt 144 0 

. 1461 



<400> 7 

atggctacta ttcaatggtt tcctggtcac atgtctaaag ctcgtcgaca ggtgcaggag 60 
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aatttaaaat ttgttgattt tgtgacgatt ttagtagatg cacgcttgcc tctatctagt 120 
caaaatccta tgttgaccaa gattgttggt gataaaccaa aactcttgat tttaaacaag 180- 
gccgacttgg ctgatccagc aatgaccaag gaatggcgtc agtattttga atcacaagga 240 
atccagacgc tagctatcaa ctccaaagag caagtgactg taaaagttgt aacagatgcg 300 
5 gccaagaagc tcatggctga taagattgct cgccagaaag aacgtgggat tcagattgaa 360 
accttgcgta ctatgattat cgggattcca aacgctggta aatcaactct gatgaaccgt 420 
ttggctggta aaaagattgc tgttgttgga aacaagccag gggtcacaaa aggtcaacaa 480 
tggcttaaaa ccaataaaga cctggaaatc ttggatacac cggggattct ctggcctaag 540 
tttgaggatg aaactgttgc acttaagttg gcattgactg gagctatcaa ggatcagttg 600 
10 cttcctatgg atgaggttac catttttggt atcaattatt tcaaagaaca ttatccagaa 660 
aagctggctg aacgcttcaa acaaatgaaa attgaagaag aaccgtctgt gattattatg 720 
gatatgaccc gcgccctcgg tttccgtgat gactatgacc gtttttacag tctcttcgtg 780 
aaggaagttc gtgatggcaa actcggtaac tataccttag atacattgga agacctcgat 840 
ggcaacgatt aa 852 

15 

<210> 8 
<211> 471 
<212> DNA 

<213> Streptococcus pneumoniae 

20 

<400> 8 

atgattaaca atgttgtact tgtagggcgt atgacacgtg acgctgagtt gcgttatacc 60 
ccatcaaatg tagcagttgc gacttttact cttgcagtaa accgtacatt taagagtcaa 120 
aatggtgaac gtgaggctga ttttatcaat gtcgttatgt ggcgccaaca ggctgaaaat 180 
25 cttgctaact gggctaaaaa aggctcactt atcggggtga caggtcgtat ccagactcgt • 240 
agttacgata accagcaagg acaacgtgtc tacgtgacag aggtcgtggc tgagaatttc 300 
caaatgttgg aaagccgtag tgtgcgtgag ggccacacag gtggagctta ctctgcacca 360 
actgcaaact attcagcacc tacaaattca gtaccagact tttcacgtaa tgaaaatcca 420 
tttggagcaa caaacccatt ggatatttca gatgatgatt taccattcta a 471 

30 

<210> 9 
<211> 975 
<212> DNA 

<213> Streptococcus pneumoniae 

35 

<400> 9 

atgaaaacgc gtattacaga attattgaag attgattatc ctattttcca aggagggatg 60 
gcctgggttg ctgatggtga tttggcaggg gctgtttcca aggctggagg attaggaatt 120 
atcggtgggg gaaatgcccc gaaagaagtt gtcaaggcca atattgataa aatcaaatca 180 

40 ttgactgata aaccctttgg ggtcaacatc atgctcttat ctccctttgt ggaagatatc 240 
gtggatctcg ttattgaaga aggtgttaaa gttgtcacaa caggagcagg aaatccaagc 300 
aagtatatgg aacgtttcca tgaagctggg ataatcgtta ttcctgttgt tcctagtgtc 360 
gctttagcta aacgcatgga aaaaatcggt gcagacgctg ttattgcaga aggaatggaa 420 
gctggggggc atatcggtaa attaacaacc atgaccttgg tgcgacaggt agccacagct 480 

45 atatctattc ctgttattgc tgcaggagga attgcggatg gtgaaggtgc tgcggctggc 540 
tttatgctag gtgcagaggc tgtacaggtg gggacacggt ttgtagttgc aaaagagtcg 600 
aatgcccatc caaactacaa ggagaaaatt ttaaaagcaa gggatattga cactacgatt 660 
tcagctcagc actttggtca tgctgttcgt gctattaaaa atcagttgac tagagattt-t 720 
gaactggctg aaaaagatgc ctttaagcag gaagatcctg atttagaaat ctttgaacaa 780 

50 atgggagcag gtgccctagc caaagcagtt gttcacggtg atgtggaggg tggctctgtc 84 0 
atggcaggtc aaatcgcagg gcttgtttct aaagaagaaa cagctgaaga* aatcctaaaa 900 
gatttgtatt acggagccgc taagaaaatt caagaagaag cctctcgctg gacaggagtt 960 
gtaagaaatg actaa 975 

55 <210> 10 
<211> 423 
<212> DNA 
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<213> Streptococcus pneumoniae 



10 



<400> 10 

atgatcgata ttcaaggaat caaagaagcc cttccccacc gttatcctat 
gaccgtgtct tggaggtgag cgaggatacc attgttgcta tcaaaaatgt 
gagcctttct ttaacggcca ctttcctcaa tacccagtta tgccaggtgt 
gaagccttgg cgcaaactgc cggtgtgttg gagttatcaa aacctgaaaa 
ctggtctttt acgctggtat ggataaggtt aagttcaaga agcaagttgt 
caattggtta tgacagcgac ttttgtaaaa cgtcgtggca ccatagctgt 
aaggctgaag tggatggcaa gcttgcagcc agtggtaccc ttacttttgc 
taa 



gcttctagtg 60 
gaccatcaac 120 
tctgattatg 180 
taaaggaaaa 240 
accaggcgac 300 
ggttgaagca 360 
aattgggaac 420 
423 



<210> 11 . 

<211> 1023 

15 <212> DNA 

<213> Streptococcus pneumoniae 



<400> 11 
atgattaatc 

20 gaggctattg 
gcggatcagc 
atggcaatga 
gttggt'caaa 
tatgaaaact 

25 gagtttgttt 
gcagccatta 
gctcatagca 
gccaatatta 
gaaaagttgg 

30 gagttggcct 
aatgacttga 
tataaagtca 
tctcgttctg 
gccaatcgtc 

35 caccgtgtct 
taa 



aaatttatca 
accaagagaa 
gttactacca 
ttcacgagtc 
aagttgtcat 
acatgacagg 
ctctccctaa 
cagagtttgt 
agcgggagcg 
tcaactatac 
aactcttctc 
ttgaccatgc 
ttcgctacat 
atctcaatac 
gtcgcattga 
ttaaaaatat 
ttgcaaccga 



actaactaaa 
tcatatcctt 
gggaaaacgt 
atgtggaatc 
gattcccaat 
gacccatttc 
agatcgtgtg 
cagtgtgggc 
gatccccgtt 
tttgccagaa 
atttgccaaa 
ttttgaatgt 
tcgtcctcag 
tcgcgatgcc 
ttttgaaaat 
cctttatcta 
tttaaacaca 



cctaagttta 

atccgtccca 

gatcccaaga 

gtcatttctg 

cagtctccta 

ttgtctagtg 

gtggcttatg 

atgcacgcta 

attggagatg 

gcagagattg 

gaatgctata 

tgtggtggtg 

ggaacaattc 

ttagaaaagg - 

gctatccaaa 

gaagaacctg 

gcctttaaaa 



tcaatgtcaa 
actacatggc 
ttttgaataa 
acccgagcgg 
tgcagagtga 
gatttgatgg 
atgctattga 
tgaatcgtct 
gaagtttagc 
tggttattgg 
ttacggataa 
atggtactgg 
tcatgatggg 
gcttgctctt 
tgatgaaagt 
taagagaaat 
cagtgtttaa 



atatcaggaa 
tgtctgtcat 
aaagcttcca 
aacctacgag 
tgaagaattc 
ctttatgaga 
agatacggtt 
attgactctt 
ttttgtggtt 
tcgtcattgg 
tattcctgaa 
accagctatt 
agttagcgaa 
ggttgggtca 
caagaaattt 
taaagatatt 
gtgggaagta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 • 

1020 

1023 



<210> 12 
<211> 1344 
40 <212> DNA 

<213> Streptococcus pneumoniae 



<400> 12 
atgaacttaa 

45 cgtcttggac 
ttacaaaacc 
ctgacaactg 
ccaagcggtg 
aaaactggga 

50 gactatatcc 
ttcggtgaaa 
gctactgttc 
atagagtatt 
gaagggattc 

55 aacttgggtg 
• ttgacaaaac 
tacggtatcc 



aaactacttt 
gtggaagtac 
tagctaagaa 
ccctcactgt 
ccaacatgat 
aaaatattgc 
agcctagtct 
tctatactac 
tccttaacgg 
ttggttttga 
tctgtcctga 
cctatatctg 
tggttgagtt 
aaatcggcgg 



gggccttctt 
gctcccaggg 
ctacgagatt 
cggcatttta 
tacagggatt 
cgtcctcgaa 
ttttgtcatt 
ctataacatg 
agacagtcca 
cttggaaaaa 
ctgccaaggc 
tgagggttgt 
gaccaacaat 
gctctataat 



gctggacgtt 
aaagtcgccc 
gtcgttgtca 
aaagaggtct 
gcaacaacct 
attgacgaag 
actaatatct 
atattggatg 
cttttctaca 
ggaccagccc 
atcctcaaat 
ggatgtaaac 
cgctctcgct 
atctataacg 



cttcccactt 
ttcaatttga 
ctggaacaaa 
atggtcaagt 
tcctaacagc 
ccagtctatc 
tccgtgacca 
ccattcggaa 
agccaactat 
aactggctca 
atgagcataa 
gtcctgatct 
ttgtcataga 
ccctagctgc 



cgttttaagc 60 
taaagatatt 120 
tggaaaaacc 180 
tctaaccaat 240 
caaatcttct 300 
tcgtatctgt 360 
gatggaccgt 420 
agttccaact 480 
tccaaaccct 540 
ctacaatacc 600 
tacctatgca 660 
cgactatcgt 720 
cggccaagaa 780 
tgtggccatc 840 
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10 



15 



20 



25 



30 



gcccgtttcc tcggcgcaga ttcccaactc atcaaacagg gatttgacaa 
gtctttggac gccaagaaac ctttcatatc ggtgacaagg aatgtaccct 
aaaaatccag tcggtgcaac ccaagctatc gaaatgatca aactagcacc 
agcctatctg tcctccttaa tgccaactat gcagatggaa ttgacactag 
gatgcagact ttgagcaaat cactgacatg gacattcctg aaatcaacgc 
cgtcattctg aaatcgctcg tcgcctccga gtgactggct atccagctga 
gaaacgagta atctggagca agttctcaag accattgaga atcaagactg 
tatattctgg caacttatac tgccatgctg gaatttcgtg aactgctggc 
attgttagaa aggagatgaa ctaa 

<210> 13 
<211> 783 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 13 

atggtttata cttcactttc ctcaaaagat ggcaattacc cctatcagct 
.cacctctacg gaaatctcat gaatacctac ggggacaatg gaaacatcct 
tatgtggctg aaaaactggg agcccatgtg accgttgaca tcgtttctct 
tttgatgaaa atcactacga catcgccttt ttcggtggtg gtcaagactt 
atcattgcag acgacctacc tgctaaaaaa gagagGattg acaactacat 
ggtgtagttc tggctatctg cggtggtttc caactattgg gtcaatatta 
tcaggaaaac gtatcgaagg gctaggggtc atgggacact acacgctcaa 
aaccgtttta tcggtgacat caagattcac aatgaagatt tcgatgaaac 
tttgaaaatc accaaggtcg taccttcctc tctgatgacc aaaaaccgct 
gtctatggaa atggaaacaa cgaagaaaag gtcggtgaag gggttcatta 
tttggttcct acttccacgg gcctatcctc tctcgtaatg ccaatctggc 
gttactactg ccctcaagaa gaaatatggt caggacatcc aactccctgc 
attctcagcc aagaaatcgc tgaagagtac agtgacgtca aaagcaaggc 
taa 



gagccgtgct 
tgtcttgatt 
ttatccattt 
ctggatctgg 
tggcggtgtt 
gaaaatcact 
caagcatgcc 
tagtcgtcag 



900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1344 



caacattgcc 60 
catgctcaag 120 
ccatgatgac 180 
tgaacaaagt 240 
ccaaaacgac 300 
tgttgaagct 360 
ccagaccaat 4 20 
ctactatgga 480 
gggacaggtt 540 
taagaatgtc 600 
ttatcgccta 660 
ctatgaggac 720 
tgacttttct 780 
783 



<210> 14 
<211> 276 
<212> DNA 
35 <213> Streptococcus pneumoniae 

<400> 14 

atggcaaaca aacaagattt gatcgctaaa gtagcagaag ctacagaatt aactaagaaa 60 

gactcagcag ' cagcagttga agctgtattt gcagcagtag ctgactatct tgcagctggt 120 

40 gaaaaagttc agttgatcgg ttttagtaac tttgaagttc gtgagcgcgc agaacgtaaa 180 

ggtcgcaacc cacaaactgg taaagaaatg acaattgcag cttctaaagt accagcattc 240 

aaagctggta aagctcttaa agacgctgtt aaataa 276 

<210> 15 
45 <211> 840 
<212> DNA 

<213> Streptococcus pneumoniae 



50 



55 



<400> 15 

atgggaattg 

gcagctttgt 

cacacaggta 

caagggagtg 

cgtcaaatta 

gaaacggttt 

gctgtgaaga 

cgtagtccgt 



ctctagaaaa 
cggatgtttc 
gtggtaaatc 
tgagggtttt 
gaaaacaggt 
tgaaggacgt 
ctgcgcgtga 
ttgagctgtc 



tgtgaatttt 
tttgacgatt 
aactatttta 
tgatacctta 
tggcttggta 
tgcttttgga 
gaaactggct 
agggggacaa 



acatatcaag 
gaagatggct 
caactcttaa 
atcacctcga 
tttcagtttg 
ccgcaaaatt 
ctggttggaa 
atgagacgtg 



aaggtactcc 
cttatacagc 
atggtttatt 
cttctaaaaa 
ctgaaaatca 
ttggagtttc 
ttgatgaatc 
ttgccattgc 



cttagcttca 60 
tttaattggg 120 
ggtgccaagt 180 
taaagatatt 240 
gatttttgaa 300 
tgaagaagat 360 
actttttgat 420 
aggcatactt 4 80 
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gccatggagc cagctatatt agtcttagat 
agaaaagagt tgatgaccct gttcaaaaaa 
gtaacgcatt tgatggatga tgttgctgaa 
ggacgtttag taaagggggg caaaccaagt 
5 gaagttcagt tgggagtacc taaaattacg 
gtgtcattta aacgattacc ggttaagata 

<210> 16 
<211> 930 
10 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 16 

atggatattc aatttttagg aacgggggct 
15 agtctcgccc tgaaactttt ggatgagatt 
ggtacgcaaa atcgcattct ggaaaccaca 
attacccatc tgcatggaga ccacattttt 
tttcaggcca atgaagagca gacagatttg 
tttgtcttaa ccagccttcg tgtgtcaggt 
20 gagtttgacc aagattctct aggtaaaatt 
gaggagctgg accacactat tttctgtgtt 
gggacgctgg atgctgaaaa actcaaggct 
aaaatcaaaa atggccagga tcttgttttg 
tatatctcag cgccacgtcc aggtaagatt 
25 gatgccagtg tgcgtctggc tgtcaatgca 
aagggtgatg aaaaaattgc tcgtaaccat 
gtagcggtag aagcaggtgc caaacgcctc 
tcaaaagata ttagcaaact caagaaagac 
gtcaaagact tggaagaagt ggaaatctag 

30 

<210> 17 
<211> 1662 
<212> DNA 

<213> Streptococcus pneumoniae 

35 

<400> 17 

atgagtaata tcagtttaac aacacttggt 
attgctgaaa ttggagagtc catttttgtt 
gaacaattag gggtcgatgt ggtgattcca 

40 cgtattgctg gggttttctt gacccacggg 
ctcttggcag aggctaaagt tcctgtattt 
ctctttgtca aaggaaatga tgccgttaag 
aatacggaga ttgattttgg tgggacagtg 
ccagagagtc tgggaattgt cttgaagaca 

.45 ttcaaa'tttg accaaacggc tagtgaatct 
attggtcgtg acggcgtcct ggctctcctc 
caggtggcta gtgaaagtga agttagggat 
ggtcgtatca tcgttgcagc tgtttccagt 
gctgcggata aaacaggtcg acgtatcgtc 

50 cgcacagcga ttcgtcttaa gaagttgtct 
aaagatatgt ctcgctttga agaccatgag 
gaacctatca atggacttcg taagatgtcg 
gatggggacc tggtctatat tgctacggct 
cgtgtagaaa atatgattta tcaggcaggt 

55 catgtatcag ggcacggaaa tgtgcgtgat 
aagtacctct tccctgtcca aggggagtat 
atggcagttg ggatgttgcc agaacgcatc 



gagccaacag ctggtctaga tcctctaggg 540 

ctccaccagt cagggatgac catcgtcttg 600 

tatgcgaatc aagtctatgt aatggaaaag 660 

gatgtctttc aagacgttgt ttttatggaa 720 

gccttttgta aacgattggc tgatagaggc 780 

gaggagttca aggagtcgct aaatggatag 840 



ggtcagccct ctaaagcccg caacgtttca 60 
aacgaagttt ggctctttga ctgtggagaa 120 
attcgaccac gtaaggtcag caaaatcttt 180 
ggtttgccag gtttcctttc tagccgtgcc 240 
gaaatctacg gacctcaagg aatcaagtca 300 
tctcgtctgc cctaccgcat tcatttccat 360 
cttgaaatcg ataaattcac tgtgtatgca 420 
ggctatcgtg tcatgcaaaa ggatctagaa 4 80 
gctggtgttc cgttcggccc gctttttggt 540 
gaagacggaa ctgaaatcaa ggcagcagac 600 
atcactattt taggagacac tcgaaaaacg 660 
gatgtcctag ttcatgagtc cacttatggc 720 
ggtcactcaa ctaatatgca agctgcacaa 780 
ctactcaacc atatcagtgc ccgtttcctc 840 
gctgccacaa tttttgaaaa tgtccatgtg 900 

930 



ggtgtgcgtg agaatggaaa aaatatgtac 60 
ttgaatgtag ggttaaaata tcctgaaaat 120 
aacatggatt acctttttga aaatagcgac 180 
catgcggatg cgattggtgc tctaccttat 240 
gggtctgagt tgaccattga gttggcaaag 300 
aaatttaatg atttccatgt cattgatgag 360 
gtttccttct tccctacgac ttactccgtt 4 20 
tcggaaggaa gcatcgttta tacaggtgac 480 
tatgcaactg attttgctcg tttggcagag 540 
agtgattcgg ccaatgcaga cagcaatatt 600 
gaaattaccc aaactattgc tgactgggaa 660 
aatctttctc gtattcagca gatttttgac 720 
ttgacaggat ttgatattga aaatatcgtc 780 
ttagccaacg aaattctctt gattaagcct 840 
ttgattattc ttgagacagg tcgtatgggt 900 
attggtcgcc atcgttatgt agaaatcaag 960 
ccgtctattg ctaaagaagc ctttgttgcg 1020 
ggggttgtca aattgattac' ccaaagttta 1080 
ttgcagctga tgatcaatct tttgcaacct li40 
cgtgagttgg atgctcacgc taaggctgcc 1200 
ttcattccta aaaaggggac gaccatggct 1260 
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tacgagaatg gagactttgt 'tcc>igptgga tcggtttcag caggagatat cttaattgat 1320 

gggaatgcca 1 1 gg t ga t gt tgga'aat gt t gttcttcgtg accgtaaggt cttgtcagag 1380 

gatggaattt teatcgtgg^c, tat^ta^eagtc aaccgtcgtg agaagaaaat tgtggctaga 1440 

gctcgtgttc' acacgcgtgg atttgtiftat ctcaagaaga gtcgcgatat tctccgtgaa 1500 

agttcagaat tgattaacea aacggtagaa gattatcttc aaggagatga ctttgactgg 1560 

gcagatctta aagggaaggt tcgaga.taat ttgaccaagt atctctttga ccaaaccaag 1620 

cgtcgtccag ctattttacc agtagtcatg gaagcaaaat aa 1662 

<210> 18 
<211> 951 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 18 

atgacaaaag aatttcatca tgtaacggtc ttactccacg aaacgattga tatgcttgac 60 
gtaaaacctg acggtatcta cgttgatgcg actttgggtg gagcaggcca tagcgagtat 120 
ttattaagta aattaagtga aaaaggccat ctctatgcct ttgaccagga tcagaatgcc 180 
attgacaatg cgcaaaaacg cttggcacct tacattgaga agggagtggt gacctttatc 24 0 
aaggataact tccgtcattt acaggcacgt ttgcgcgaag ctggtgttca ggaaattgat 300 
ggaatttgtt atgacttggg agtgtctagt cctcaattgg accagcgtga gcgtggtttt 360 
tcttataaaa aggatgcgcc actggacatg cggatgaatc aggatgctag tctgacagcc 420 
tatgaagtgg ttaatcatta tgactatcat gatttggttc gtattttctt caaatacggt 480 
gaggataaat tctctaaaca gattgcgcgt aagattgagc aagcgcgtga agtgaagccg 54 0 
attgagacaa cgactgagtt agcagagatt atcaagttgg tcaaacctgc caaggaactc 600 
aagaagaagg gtcatcctgc taagcagatt ttccaggcta ttcgaattga agtcaatgat 660 
gaactggggg cggcagatga gtccatccag caggctatgg atatgttggc tctggatggt 720 
agaatttcag tgattacctt tcattcctta gaagac'cgct tgaccaagca attgttcaag 780 
gaagcttcaa cagttgaagt tccaaaaggc ttgcctttca tcccagatga tctcaagccc 84 0 
aagatggaat tggtgtcccg taagccaatc ttgccaagtg cggaagagtt agaagccaat 900 
aaccgctcgc actcagccaa gttgcgcgtg gtcagaaaaa ttcacaagta a 951 

<210> 19 
<211> 999 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 19 

atgagtagaa ttttagataa tgagataatg ggggatgagg agttagtaga acgcacgctc 60 
cgtcctcagt atttacgtga atatatcgga caggataagg tcaaggacca gctacaaatc 120 
tttattgaag ctgccaaaat gcgggatgaa gcgctggatc atgtgctctt atttgggcct 180 
ccaggtctcg ggaaaacgac catggccttt gttattgcca acgaactggg agtcaatctt 240 
aagcagacgt cgggtccagt cattgaaaaa gccggagatc tggtagctat tttgaatgag 300 
ttagagcctg gggatgtact ttttattgat gagatccatc gtttgccaat gtcagtggaa 360 
gaggtgcttt atagtgctat ggaggacttc tacatcgata ttatgattgg ggctggtgag 420 
ggtagtcgta gtgttcattt ggagttacca ccttttacct tgattggtgc gacgactcgg 480 
gctggtatgc tctccaatcc gctacgggca cgttttggga ttacaggcca tatggagtat 540 
tatgcccatg ctgacttgac agaaattgtc gagcggacgg cagatatttt tgagatggaa 600 
atcactcatg aggcagcatc tgagttggcc ctacgtagtc gtgggacccc tcgtattgcc 660 
aatcgtctcc tcaagcgcgt gcgcgatttt gcccagataa tggggaatgg ggtaattgat 720 
gatattatta ccgataaggc tttgactatg ctggatgttg accatgaagg tttggactat 780 
gtggatcaaa aaatccttcg taccatgatt gagatgtaca gtggaggacc tgttggtcta 840 
ggaactcttt ctgtgaatat cgccgaagag cgtgagacag ttgaagacat gtatgagcct 900 
tacttgattc aaaaaggttt tatcatgcgg acacggtctg gacgggtggc gactgctaag 960 
gcatatgagc acttaggtta tgaatacagt gaaaaataa 999 

<210> 20 
<211> 1311 
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<212> DNA • 

<213> Streptococcus pneumoniae 
<400> 20 

5 atgagtatgt ttttagatac agctaagatt 
atggttgcct ttcgtcgtga aaaatatgtc 
ggtcgtggag gcaatgtggt cttcgttgta 
cgctacaatc gccatttcaa ggctgattct 
ggtcgtggtg ctgaggacct tagagttcga 

10 gagactggca aggttttaac agatttgatt 
ggtggtcgtg gtggacgtgg aaatattcgt 
atctctgaaa atggagaacc aggtcaggaa 
gcagatgtcg gtttagtagg attcccatct 
acctcagcta agcctaaaat tggtgcctac 

15 atggttcgca cccaatcagg tgaatccttt 
ggggctagtc aaggtgttgg tttgggaact 
gttatccttc acatcattga tatgtcagct 
ctagctatca ataaagagct ggagtcttac 
attgtagcta ataagatgga catgcctgag 

20 aaattggctg aaaattatga tgaatttgaa 
ttgaccaagc aaggtctggc aacactttta 
ccagaatttt tgctctacga cgagtccgat 
gaagaagaaa aagcctttga aattagtcgt 
gaaaaactca tgaaactctt taatatgacc 

25 tttgcccgtc agcttcgtgg tatgggggtt 
gatggggatt tggtccgcat tggtaaattt 

<210> 21 
<211> 519 
. 30 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 21 

atgaactact ttaatgttgg gaaaatcgtt 

35 - gtcttgtctg tgacggattt tgcagaagaa 

tttgatgaaa aagatcagtt tgtccaaaca 

aactttgaca ttattaaatt caaagatatg 

ggatacagtc tcaaggtcgc tgaggaagat 

tatcacgaga ttatcggttt ggaagtctat 

40 gaaatcctgc aaccaggtgc taatgatgtc 

ttgcttttac cttatatccc accagtggtt 

gatgtggaaa tcttagaagg gttagacgat 

<210> 22 
45 <211> 720 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 22 

50 atgaagattg atattttaac cctctttcca 

gttggaaagg ctcgagaaaa agggctcttg 

gctgaaaagg cccgtcat.gt agatgatgag 

agagcacaac ctattttcga ttcctttgat 

ctcctcgatc ctgctggaaa gcagtttgat 

55 gaagagctaa tctttatctg tgggcactat 

gtaacagatg agatttccct aggcgactat 

accatgattg atgctacagt tcgcctgatt 



aaggtcaagg ctggtaatgg tggcgatggt 60 
cctaatggag gcccttgggg tggtgatggt 120 
gacgaaggac tacgtacctt gatggatttc 180 
ggtgaaaaag ggatgaccaa agggatgcat 240 
gtatcacaag gtacgactgt tcgtgatgcg 300 
aaacatgggc aagaatttat cgttgcccac 360 
tttgcgacac caaaaaatcc tgcaccggaa 420 
cgtgagttac aattggaact aaaaatcttg 480 
gtagggaagt caacactttt aagtgttatt 54 0 
cactttacca ctattgtacc aaatttaggt 600 
gcagtagccg acttgccagg tttgattgaa 660 
cagttcctcc gtcacatcga gcgtacacgt 720 
agcgaaggcc gtgatccata tgaggattac 780 
aatcttcgcc tcatggagcg tccacagatt 840 
agtcaggaaa atcttgaaga atttaagaaa 900 
gagttaccag ctatcttccc aatttctgga 960 
gatgctacag ctgaattgtt agacaagaca 1020 
atggaagaag aagcttacta tggatttgac 1080 
gatgacgatg cgacatgggt actttctggt 1140 
aactttgatc gtgatgaatc tgtcatgaaa 1200 
gatgaagccc ttcgtgcgcg tggagctaaa 1260 
gagtttgaat ttgtagacta g 1311 



aatacgcagg gattacaggg tgagatgcga 60 
cggtttaaaa aaggagctga gctggctttg 120 
gtgaccatcg ctagccaccg taaacagaag 180 
taccatatca atactatcga aaagtacaag 240 
ttgaatgacc tagacgatgg tgaattttac 300 
gagggtgata gcttggttgg aaccatcaag 360 
tgggtggtca aacgaaaagg caaacgtgat 420 
ctcaatgttg atattccaaa taaacgggtc 480 
gaagattga 519 



gagatgtttt ctccactgga gcactcaatc 60 
gatatccagt atcataattt tcgagaaaat 120 
ccctacggag gcggtcaggg catgttgctc 180 
gctattgaaa agaaaaatcc gcgcgttatt 240 
caggcttatg ctgaagattt ggctcaagag 300 
gagggttatg atgagcgcat taagaccttg 360 
gtcctcactg gtggagaatt ggcagctatg 420 
ccagaagtga ttggcaagga gtctagccac 4 80 
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15 



20 



caagatgata gtttttcttc aggtctttta 
tatcgaggca tggtcgtgcc agatgtattg 
tggcgattgt acgagagttt aaagaaaacc 
tatcaactga cagtagaaga agaaaaaatg 

<210> 23 
<211> 561 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 23 

atgattgaag caagtaaatt aaaagctggt 
attcgcgttt tggaagctag tcaccacaaa 
aaattgcgtg atgtccgtac tggttctaca 
tttgaacaag ctattatcga gactgtccca 
gcatacttca tgaatacaga aacttacgac 
aacgaattgc tttacatcct tgaaaactct 
gtgatcggtg tcaccgttcc tactactgtt 
atcaaaggtg ctactgttac aggttctggt 
gtaaacgttc cagacttcat cgaagcagga 
acttacgttt ctcgtgccta a 



gaatatcatc agtacacacg 

"atgagtggcc atcatgaaaa 

tacgagcgca gaccggattt 

ctggcagaaa tcaaagaaaa 



atgacctttg 
ccaggtaaag 
tttgacacaa 
gctcaatact 
cagtacgaaa 
gatgtgaaaa 
gagttgacag 
aaaccagcaa 
caaaaactcg 



tccctatgat 540 
gattcgtcag 600 
acttgaacat 660 
caaagaataa 720 



aaacagctga 
gaaacacgat 
gctaccgtcc 
tgtacaaaat 
tccctgtagt 
tccaattcta 
ttgctgaaac 
cgatggaaac 
ttatcaacac 



cggcaaattg 60 
catgcgtatg 120 
agaggaaaaa 180 
ggatgacaca 240 
caatgttgaa 300 
cggaactgaa 360 
tcaaccatct 420 
tggacttgtc 480 
tgcagaagga 54 0 
561 



<210> 24 
<211> 1572 
25 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 24 

atggcatttg aaagtttaac agaacgtttg cagaacgtct ttaaaaatct acgtaaaaaa 60 

30 ggaaaaatct ctgaatctga tgtccaagag gcaaccaaag aaattcgctt ggccttgct'c 120 
gaggccgacg ttgccttgcc tgttgtaaag gactttatca agaaagttcg tgagcgtgca 180 

- gtcgggcatg aggtcattga tacacttaat cctgcgcaac agattattaa aatcgttgat 24 0 
gaggaactga cagccgtttt aggttctgat acggcagaaa ttatcaagtc acctaagatt 300 

ccaaccatca tcatgatggt tggtttacaa ggggctggta aaacaacctt tgctggtaaa 360 

35 ttggccaaca aactcaagaa agaagaaaat gctcgtcctt tgatgattgc ggcggatatt 420 

tatcgtccag ctgccattga ccagcttaag accttgggac aacagattga tgtgcctgtc 480 

tttgcacttg gaacagaagt accagctgtt gagattgtac gtcaaggttt ggagcaagcc 54 0 

caaactaatc ataacgacta tgtcttgatt gatactgcgg gtcgtttgca gattgatgag 600 

ctcctcatga atgagcttcg tgatgtgaaa acattggctc aaccaaatga aatcttgctt 660 

40 gtcgttgatg ctatgattgg tcaggaagca gccaatgttg cgcgtgagtt taatgctcag 720 

ttggaagtga ctggggtcat ccttaccaag attgatggcg atactcgtgg tggtgctgct 780 

ctgtctgttc gtcacattac tggaaaacca atcaagttca ctggtacagg tgaaaagatt 84 0 

acggacattg aaaccttcca cccagaccgc atgtctagcc gtatccttgg tatgggggat 900 

.atgctcactt tgattgagaa agcttctcag gaatacgatg aacaaaaagc ccttgaaatg 960 

45 gctgagaaga tgcgcgaaaa cacctttgat tttaatgatt tcatcgatca attagatcag 1020 

gtgcaaaata tggggccgat ggaagacttg ctcaagatga ttccaggtat ggcaaacaat 1080 

ccagcccttc aaaacatgaa ggtggatgaa cgccagattg ctcgtaaacg tgccattgtg 114 0 

tcttcgatga cacctgaaga gcgtgaaaac ccagatttgt taaatccaag ccgtcgccgt 1200 

cgtattgctg ctggttctgg aaatacattc gtcgaagtca ataaattcat caaggacttt 1260 

50 aaccaggcta aacagctcat gcagggtgtt atgtctgggg atatgaataa aatgatgaag 1320* 

caaatgggga ttaatccaaa taaccttcct aaaaatatgc caaatatggg aggaatggat 1380 

atgtctgccc ttgaaggaat gatgggacaa ggcggtatgc ctgacttatc agctctcgga 1440 

ggagcaggaa tgccagatat gagccagatg tttggtggcg gtttgaaagg taaaattggt 1500 

gaatttgcta tgaaacagtc catgaaacgt atggctaaca aaatgaagaa agcgaagaag 1560 

55 aaacgcaagt aa 1572 

<210> 25 
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<211> 846 
<212> DNA 

<213> Streptococcus pneumoniae 

5 <400> 25 

atgtatctta ttgaaatttt aaaatctatc 
tggttgccga tttccagtac aggtcatttg 
aatcaaaatg aagcctttat gtccatgttt 
gcagttatgg tgatttattt taacaagctc 

10 gaagttcgta agacttggag actatggttg 
ggtgtcttta aatttgatga ttggtttgat 
ctcatgttga ttatctacgg ggttgccttc 
gctatcgagc caagtgtaac agagttggac 
ggactcttcc aagttcttgc tcttttacca 

15 ggtggtttgt taaatggaac cagtcgttca 
attcccgtta tgtttggagc tagtgcctta 
ctcttgagct ttgggcaatt gtttttgctc 
agcatggtgg ctattcgctt cttgaccagc 
ggtaaatacc gtatcgtgct tggtagtgtt 

20 gtataa 

<210> 26 
<211> 1290 
<212> DNA 
25 <213> Streptococcus pneumoniae 



ttcttcggga ttgttgaagg aattacggaa 60 
attttagcag aggagtttat ccaataccaa 120 
aatgtcgtga ttcagcttgg tgctatttta 180 
aatcctttta aaccgactaa ggacaaacag 240 
aaggtcttga ttgctacttt acctttactt 300 
acccacttcc ataacatggt ttcagttgct 360 
atctatttgg aaaagcgcaa taaagcgcgt 420 
aagcttcctt atacgaccgc tttctatatc 480 
gggactagcc gttcaggtgc aacgattgtc 54 0 
gttgtgacag aatttacctt ctatcttggg 600 
aagattttca aatttgtgaa agccggagaa 660 
ttggtcgcga tgggagtagc ttttgcggtc 720 
tatgtgaaaa aacacgactt cacccttttt 780 
ttgctacttt acagttttgt ccgtttattt 840 

846 



<400> 26 

atgggattat ttgaccgtct attcggaaaa aaagaagaac ctaaaatcga agaagttgta 60 
aaagaagctc tggaaaatct tgatttgtct- gaagatattg agcctgcctt cacagaagct 120 

30 gaggaagttt ctcaagaaga agcagaggtt gaaagttctg aagaatctgt gttccaagaa 180 
gaggatagtc aagacacagt cgaagaaaat ctggatttag agccagttgt agaggtttct 240 
caagaagaag tagaagaatt tccaaactca caagaagtca cagaggaaga gaagcttgag 300 
cacgaaggaa ctgtagaaga aaataatttt gaagtgcttg aaccagaagc tcctcaaaca 360 
gaagaaactg ttcaggaaaa atatgaccgc agtcttaaga aaactcgcac aggtttcggt 420 

35 gcccgcttga atgccttctt tgctaacttc cgctctgttg acgaagaatt tttcgaggaa 480 
ctggaagaac tgttgattat gagtgatgtt gg.tgtccaag tcgcttctaa cttaacggag 540 
gaactacgtt acgaagccaa gcttgaaaat gccaagaaac ctgatgcact tcgtcgtgtc 600 
atcattgaga aattggttga gctttatgaa aaggatggta gctacgatga aagcatccac 660 
ttccaagata acttgacagt tatgctcttt gttggtgtga atggtgttgg gaaaacaact 720 

40 tctatcggaa aactagccca ccgctacaaa cgagctggta agaaggtcat gctggttgca 780 
gcagatacct tccgtgcggg tgcagtagct c'agctagctg aatggggccg acgagtagat 840 
gttccagtag taactggacc tgaaaaagct gatccagcca gcgtggtctt tgatggtatg 900 
gaacgtgccg tggctgaagg tatcgatatt ctcatgattg atactgctgg tcgtctgcaa 960 
aataaggata accttatggc tgagttggaa aagattggtc gtattatcaa acgtgttgtg 1020 

45 ccagaagcac cacatgaaac. cttcttggca cttgatgcat caacaggtca aaatgcccta 1080 
gtacaggcca aagaattttc gaaaatcaca cctttaacgg gaattgtttt gactaagatt 1140 
gatggaactg ctcgaggagg tgtggttcta gccattcgtg aagaactcaa tattcctgta 1200 
aaattgattg gttttggtga aaaaatcgat gatattggag agtttaactc agaaaacttc 12.60 
atgaaaggtc tcttggaagg tttaatctaa 1290 

50 

<210> 27 
<211> 498 
<212> DNA 

<213> Streptococcus pneumoniae 

55 

<400> 27 

atgtatattg aaatggtaga tgaaactggt caagtttcaa aagaaatgtt gcaacaaacc 60 
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caagaaattt tggaatttgc agccaaaaaa ttaggaaaag aagacaagga gatggcagtc 120 

acttttgtga ccaatgagcg tagtcatgaa cttaatctgg agtaccgtga caccgaccgt 180 

ccgacagatg tcatcagcct tgagtataaa ccagaattgg aaattgcctt tgacgaagag 24 0 

gatttgcttg aaaatccaga attggcagag atgatgtctg agtttgatgc ctatattggg 300 

5 gaattgttca tctctatcga taaggctcat gagcaggccg aagaatatgg tcacagcttt 360 

gagcgtgaga tgggcttctt ggcagtacac ggctttttac atattaacgg ctatgatcac 420 

tatactccgg aagaagaagc ggagatgttc ggtttacaag aagaaatttt gacagcctat 480 

ggactcacaa gacaataa 498 

10 <210> 28 
<211> 768 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 28 

atgagtattc gagtaattat tgccggtttt aagggaaaga tgggccaggc tgcttgtcag 60 
atggtattga ctgatccaga cttggacttg gtggcagttt tggatccttt tgagtctgag 120 
tcagaatggc agggtattcc tgttttcaag gataaggctg atttagctgg ttttgaagcg 180 
gatgtctggg tagattttac tactccagct gttgcctacg aaaatacacg ttttgctctt 240 
gaaaatggct ttgctccagt agttggaacg actggtttca cgagtgaaga aattgcagag 300 
ctaaaagaat tttctcgtgc ccaagacttg ggtggcctga ttgcccctaa ctttgccttg 360 
ggtgctgtct tactcatgca atttgcgacg caggctgcca aatatttccc aaatgtggag 420 
attattgagc tccatcatga caagaaaaag gatgctccga gtggaacagc cattaaaaca 480 
gctgagttga tggcagaggt tcgagagtca attcagcaag gtgcagcaga tgaggaagag 540 
ctgattgctg gtgctcgtgg tgctgacttt gatggtatgc gcatccactc agttcgtttg 600 
ccaggcttgg tagctcatca ggaagtcatc tttggcaatc agggagaagg gttgaccctc 660 
cgtcatgact cctatgatcg catctccttc atgacaggag tcaatttggg aattaaagaa 720 
gttgtcaagc gtcatgagct tgtctatgga ttagaacact tattatga 7 68 

30 <210> 29 
<211> 276 
<212> DNA 

<213> Streptococcus pneumoniae 
35 <400> 29 

atggcaaaca aacaagattt gatcgctaaa gtagcagaag ctacagaatt aactaagaaa 60 
gactcagcag cagcagttga agctgtattt gcagcagtag ctgactatct tgcagctggt 120 
gaaaaagttc agttgatcgg ttttagtaac tttgaagttc gtgagcgcgc agaacgtaaa 180 
ggtcgcaacc cacaaactgg taaagaaatg acaattgcag cttctaaagt accagcattc 240 
40 aaagctggta aagctcttaa agacgctgtt aaataa 276 

<210> 30 
<211> 921 
<212> DNA 
45 <213> Streptococcus pneumoniae 

<400> 30 

atgactaaaa cagccttttt atttgctggt caaggtgccc agtatctagg gatgggacgg 60 

gatttctatg atcagtatcc gattgttaaa gaaacgattg atcgagcgag tcaggtgctc 120 

50 ggttatgatt tgcgttatct catcgatacg gaagaggaca aactcaatca gacccgctat 180 

acgcaaccag ccattctagc gacttcggtt gctatctacc gtttattgca agaaaagggc 240 

tatcagcctg atatggtcgc tggtttgtct cttggagaat actctgcctt ggtggcaagc 300 

ggcgccttgg attttgaaga tgcggttgcc ttggtagcta agcgtggagc ctatatggaa 360 

gaagcggctc . ctgctgactc tggcaagatg gtagcagttc tcaatacgcc agtagaggtc 420 

55 attgaagaag cctgtcaaaa agcttctgaa cttggagtgg ttactccagc caactataac 4 80 

acacctgcac aaatcgtcat tgctggagaa gtggttgcag ttgatcgagc ggttgaactt 540 

ttgcaagaag caggtgccaa acgcttgatt cctcttaagg tgtcaggtcc ctttcacacc 600 
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gctctccttg agccagctag ccagaaacta gctgaaactc tagctcaggt aagtttttca 660 

gattttactt gtcccctagt cggcaataca gaagctgctg tgatgcaaaa agaggacatt 720 

gctcagctct tgacgcgtca ggtcaaggaa cccgttcgtt tctatgaaag tattggggtc 780 

atgcaagaag caggcataag caactttatc gagattggac cggggaaagt cttgtcaggt 84 0 

5 tttgttaaaa aaattgatca aactgctcac ttagctcatg tggaagatca agcgagttta 900 

gtagcacttt tagaaaaata g 921 

<210> 31 
<211> 732 
10 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 31 

atgaaactag aacataaaaa tatctttatt acaggttcga gtcgtggaat tggtcttgcc 60 
15 atcgcccaca agtttgctca agcaggagcc aacattgtct taaacagtcg tggggcaatc 120 
tcagaagaat tgctcgctga gttttcaaac tatggtatca aggtggttcc catttcagga 180 
gatgtatcag attttgcaga cgctaagcgt atgattgatc aagctattgc agaactgggt 240 
tcagtagatg ttttggtcaa caatgcaggg attacccaag atactcttat gctcaagatg 300- 
acagaagcag attttgaaaa agtgctcaag gtcaatctga ctggtgcctt taatatgaca 360 
20 caatcagtct tgaaaccgat gatgaaagcc agagaaggtg ctatcattaa tatgtctagt 420 
gttgttggtt tgatggggaa tattggtcaa gctaactatg ctgcttctaa ggctggcttg 480 
attggcttta ccaagtctgt ggcacgcgag gtcgctagtc ggaatatacg agtcaatgtg 540 . 
attgctccag gaatgattga gtctgatatg accgctatct tatcagataa gattaaggaa 600 
gctacactag ctcagattcc gatgaaagaa tttgggcagg cagagcaggt tgcagatttg 660 
25 acagtatttt tagcaggcca agattatcta actggtcaag* tggttgccat tgatggtggc 720 
ttaagtatgt ag 732 

<210> 32 

<211> 831 

30 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 32 

atgggagtga aaaagaaact aaagttgact agtttgctag gactgtctct gttaatcatg 60 
35 acagcctgtg cgactaatgg ggtaactagc gatattacag ccgaatcggc tgatttttgg 120 
agtaaattgg tttacttctt tgcggaaatc attcgctttt tatcgtttga tattagtatc 180 
ggagtgggga ttattctctt tacggtcttg. attcgtacag tcctcttgcc agtctttcag 240 
gtgcaaatgg tggcttctag gaaaatgcag gaagctcagc cacgcattaa ggcgcttcga 300 
gaacaatatc caggtcgaga tatggaaagc agaaccaaac tagagcagga aatgcgtaaa 360 
40 gtatttaaag aaatgggtgt cagacagtca gactctcttt ggccgatttt gattcagatg 420 
ccggttattt tggccctgtt ccaagcccta tcaagagttg actttttaaa gacaggtcat 4 80 
ttcttatgga ttaaccttgg -tfagtgtggat acaacccttg ttcttccgat tttagcagca 540 
gtattcacct ttttaagtac ttggttgtcc aacaaagctt tgtctgagcg aaatggcgct 600 
acgactgcga tgatgtatgg gattccagtc ttgattttta tctttgcagt ttatgcgcca 660 
45 ggtggagtcg ccctatactg gacagtgtct aatgcttatc aagtcttgca aacctatttc 720 • 
ttgaataatc cattcaagat tatcgcagag cgcgaggccg tagtacaggc acaaaaagat 780 
ttggaaaata gaaaaagaaa agccaagaaa aaggctcaga aaacgaaata a 831 

<210> 33 
50 <211> 1230 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 33 

55 atgaagatta gtaagaggca cttattaaat tattccatct tgattcccta cttactttta 60 
tctattttgg gcttgattgt ggtctattcg accaccagtg ctattttaat tgaagaaggc 120 
aagagcgcct tgcagttggt tcgaaaccaa ggaatctttt ggattgttag tttgatactg 180 
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attgccttaa 
gttatattaa 
ggggcatacg 
atcattatta 
5 tatgattttc 
ttcgttctcc 
attttagtct 
tcaaccattc 
atcggtgttg 

10 gcctttttta 
tttgccatgg 
ggttatttgc 
tttgttggtg 
gtcggtatcc 

15 atgttggttc 
gtgactttcc 
gcctttgtct 
aatcaaccaa 



tttataaatt 
tagaaatgct 
gttggatttc 
tttggtattt 
aagttttgac 
tagttctgat 
tggtttcctt 
tggcgctcgt 
agaccttttc 
atccttttgc 
tcaatggtgg 
cagaagctca 
ccagtcttat 
gagcggagaa 
aggtatttgt 
ctttcttatc 
taaatattga 
tgaaccttct 



gagactagat 
tttattgttc 
ggttgcagga 
agctcaccga 
tcaaaatcaa 
tggaagtttg 
gattatgtat 
atctgccact 
aaaaattcca 
cgatcgtgct 
ttggtttggt 
tacagacttt 
tttagctctc 
tcctttcaat 
caatatcgga 
ccagggtgga 
tgccagtgaa 
gttgaagtag 



tttttgagaa 
ttggctcgtt 
gtaactattc 
ttctccaaac 
tggcttcccc 
ggaattttcc 
acagttagtg 
tctgtctttg 
gtatttggct 
gatgcaggtc 
ctaggtcttg 
gtcttttcta 
ttgtttttca 
gccatggttg 
gggatttcgg 
aatagtcttc 
aaacgcgcta 



atgagcgact 
ttattggtat 
agccagctga 
agcaagaaga 
gtgcttttaa 
ctgatttagg 
gaatcgctta 
tcttgaccac 
atgtagccaa 
accagttagc 
gaaactcgat 
tcgtgattga 
tgattttgcg 
cactcggtgt 
gcttgattcc 
tagtcttatc 
agttgtaccg 



aatcatttta 
ttcagtaaac 
gtacttaaaa 
aatagctact 
tgattggcga 
aaatgcgact 
tcgctggttt 
tatcagccta 
gcgctttagt 
taattcttat 
tgaaaaacga 
agaatttggc 
gattatcttg 
cggagggatg 
atctacagga 
agtggcagta 
agaattggaa 



20 <210> 34 

<211> 1260 

<212> DNA 

<213> Streptococcus pneumoniae 



25 <400> 34 

atgctcggaa 
gggcacttct 
ggacctaaaa 
cctctgggtg 

30 ggaacgcctg 
ggtaaaaaat 
aagctcttta 
gcaacggttg 
caaaatgcga 

35 atcttaggtg 
gataccaatc 
acggcacaaa 
gctgtggaaa 
aaggggagtg 

40 ggtgttcaac 
' gctgctgact 
ttgaacaagt 
ggaattgaga 
cttattccga 

45 cgccgcaaac 
atggttgtct 



ttttaacctt 
actttgccaa 
tctttgctca 
gctatgtccg 
ttagtttgac 
tggatcaaac 
tcaaaggatt 
tggaagcaga 
ctatctgggg 
tcgttgtttt 
agttccatat 
ttaccaagat 
cagaaaccaa 
acaaacaagt 
cgggggttaa 
cagctctccg 
tgggtggacc 
atatcttgta 
ttccagcctt 
cattgaaaca 
tgatgattgc 



tattctggtt 
gaaatcaggg 
cattggcaag 
catggccggt 
acttgctgat 
agccctccct 
ggttctggaa 
tggtactgag 
caaactgatt 
ttgggtttta 
catgccccaa 
cggctcacat 
agataagacg 
cactgttaca 
gtcagatttt 
aattctctca 
tgttgctatc 
cttcttggca 
ggatggtggt 
agaaattgaa 
tgtgacttgg 



tttgggatta 

attttagtac 

gatggaacgg' 

tggggtgatg 

gatggtaagg 

atgcaggtga 

gaagaaaaaa 

gttcggattg 

accaattttg 

atctttatgc 

ggtgccttgg 

gaggttagca 

gcaccgactt 

cccgaagata 

ctatccatgt 

gctctgaaaa 

tttaaggcaa 

atgatttcca 

aagattgtgc 

acctatgtca 

aatgacatta 



ttgtagtggt 
gtgaatttgc 
cctataccat 
atacaactga 
ttaaacgcat 
cccagtttga 
catttgcagt 
cacctttaga 
caggtcctat 
agggtggtgt 
ccaaggtagg 
actgggaaag 
tggatgtgac 
gtcaaggtcg 
ttgtaggtgg 
atctgatttt 
gtagtgatgc 
tcaatattgg 
tcaatatcct 
ccttggccgg 
tgcgactctt 



gcacgagttc 
catcggtatg 
tcgaatcttg 
aatcaagaca 
caatctctca 
ttttgaagac 
ggatcacgat 
tgttcaatat 
gaacaatttt 
cagagatgtt 
agtaccagaa 
cttgatccaa 
tatttctgaa 
ttaccttcta 
ttttacaact 
ccaaccggat 
tgctaaaaat 
gatttttaat 
agaagccatc 
agtggtcatc 
ttttagataa 



240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080. 

1140 

1200 

1230 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 



<210> 35 
<211> 594 
50 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 35 

atgtacgcat atttaaaagg aatcattacc aaaattactg ccaaatacat tgttcttgaa 60 

55 accaatggta ttggttatat cctgcatgtg gccaatcctt atgcctattc aggtcaggtt 120 

aatcaggagg ctcagattta tgtgcatcag gttgtgcgtg aggacgccca tttgctttat 180 

ggatttcgct cagaggatga gaaaaagctc tttcttagtc taatttcggt ctctgggatt 240 
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ggtcctgtat ,cagctcttgc tattatcgct gctgatgaca atgctggctt ggttcaagcc 300 

attgaaacca agaacatcac ctacttgacc aagttcccta aaattggcaa gaaaacagcc 360 

cagcagatgg tgctggactt ggaaggcaag gtagtagttg caggagatga ccttcctgcc 420 

aaggtcgcag tgcaagcaag tgctgaaaac caagaattgg aagaagctat ggaagccatg " 480 

5 ttggctctgg gctacaaggc aacagagctc aagaaaatca agaaattctt tgaaggaacg 540 

acagatacag ctgagaacta tatcaagtcg gcccttaaaa tgttggtcaa atag 594 

<210> 36 

<211> 774 

10 <212> DNA 

<213> Streptococcus pneumoniae 

<400> is 

atgaagaata atcgtatttt agcactttct ggaaatgata tttttagtgg tggtggactg 60 

15 tcagctgatt tggctaccta taccttgaac ggcttgcatg ggtttgtagc agtgacttgt 120 

ttgacagcct tgacagaaaa aggatttgaa gtctttccaa ctgatgatac catttttcaa 180 

catgaattag atagcttgcg tgatgtggaa tttgggggaa ttaagattgg tcttctccct 240 

actgtcagtg tggctgagaa ggccttggac tttatcaaac aacgcccagg agtacctgtg 300 

gtgttggatc ctgtcttggt ctgcaaggaa acgcatgatg tagctgtcag tgagctctgc 360 

20 caagagttga ttcgcttctt cccttatgtc agtgtgatta cgcctaatct cccagaagca 420 

gaattattat ccggtcagga aattaaaacc ttggaagaca tgaaaactgc agcgcagaaa 480 

ttgcatgatt taggagcgcc agcagtcatt atcaagggag gcaatcgtct tagtcaggac 54 0 

aaggctgtgg atgtctttta tgatggacag acctttacta tcctagaaaa tccagttatc 600 

caaggccaaa atgctggtgc aggttgtacc tttgcctcta gcattgccag tcacctggtt 660 

25 . aaaggtgata aatttttgcc agcagtagaa agctctaagg ctttcgttta tcgtgctatt 720 

gcacaagcag atcagtatgg agtaagacaa tatgaagcaa acaaaaacaa ctaa 77 4 

<210> 37 

<211> 1239- 

30 <212> DNA 

<213> Streptococcus pneumoniae •' 

<400> 37 

atgattgaaa cggagaaaaa agaggagcga gtcctgctga ttggtgtgga attgcagggt 60 
35 atggacagtt ttgacctctc catggaagaa ttggctagtt tagcgaaaac ggcaggggca 120 
gtcgttgtag atagctacag acaaaaacgt gaaaaatatg attccaagac ,cttcgtcggc 180 
tctggtaagt tggaagagat tgcgcttatg gtggatgcag aagaaatcac tactgtcatc 240 
gtcaacaatc gtctgacccc aaggcagaat gtcaatctag aggaagttct cggtgttaag 300 
gtcattgacc gtatgcagtt gattttggat atctttgcca tgcgggctcg aagccatgaa 360. 
40 gggaagctcc aagtccacct agcccaattc aaatacctct tgcctcgctt ggttggtcag 420 
gggattatgc tcagccgtca ggcaggggga attggttccc gtggtcctgg tgaaagccaa 480 
ctggagctga accgtcgtag cgttcgcaat caaatcacgg atatcgagcg ccagcttaag 540 
gtggttgaga aaaatcgtgc gactgtcaga gaaaaacgtt tggagtctag cacttttaag 600 
attggtttga ttggttatac taatgctggg aaatcaacta tcatgaacat cttgaccagt 660 
45 aagacccagt atgaagcaga tgagctcttt gcgactctgg atgcgacaac caagagtatt 720 
catctgggag gcaatctcca agtaactttg acagataccg ttggctttat ccaagatttg 780 
ccgacagagt tggtgtccag tttcaagtca accttggaag aaagcaagca tgtggacctt 840 
ctggttcatg ttatcgatgc tagcaatcct taccacgagg- agcatgaaaa aacggttctc 900 
tccatcatga aagacetgga catggaagat attcctcact tgacgcttta taataaagcg 960 
. 50 gatttggtgg aggatttcac gcctacccaa- acgccatata ccctcatttc tgccaagtct 1020 
gaggacagtc gtgaaaactt gcaagcatta ttgctagata agattaagga aatttttgaa 1080 
gcatttaccc tgcgagtgcc tttttcaaag tcctacaaga ttcatgattt agagagtgtt 1140 
gcaattctgg aagaacgtga ttatca'ggaa gacggcgaag tgattacagg ctacatttcc 1200 
gagaaaaata aatggaggtt agaagaattt tatgactga 1239 

55 

<210> 38 
<211> 483 
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<212> DNA 

<213> Streptococcus pneumoniae 



<400> 38 
5 atggcagaaa 
gaagaattga 
tacggtgacc 
gaaggacaaa 
gcagttgccc 
10 gacgaagaag 
gtttcaaatg 
accattgaaa 
taa 



aaacatatcc 
aattggttcg 
tttcagaaaa 
tctctagctt 
aggacgaagt 
aagtttatat 
aaagcccaat 
cgcctgttgg 



tatgaccctt 
tcgaccagaa 
cagtgagtac 
agaaacaaaa 
agcgattggt 
tatcgtaggt 
tgggcaggcc 
tagctatgat 



gaggaaaagg 
gtggtagaac 
gaagcagcta 
atccgctatg 
aaaacagtca 
tcagctggtg 
ttgattggca 
gtaaaaatct 



aaaaacttga 
gcattaagat 
aggatgaaca 
ctgaaatcgt 
ccatccaaga 
cggatgcctt 
agaaaacagg 
tgaaggttga 



aaaagaatta 60 
tgcccgttca 120 
agcctttgtc 180 
caatagcgac 24 0 
aattggtgag 300 
tgcaggtaag 360 
tgatacagca 420 
aaaaacagcc 4 80 
483 



15 <210> 39 

<211> 570 

<212> DNA 

<213> Streptococcus pneumoniae 



20 <400> 39 

atg'accaaat 
aatgttggtt 
gataagatat 
gttaaaccaa 

25 tatggtttgg 
aaaattcgtt 
caacatatag 
ggtatgtcag 
ttacagtctg 

30 gagaaaacaa 



tacttgtagg 
ttatgttgat 
ttcaagctga 
cgacctttat 
atattgacga 
taagagcaaa 
gaactcaggt 
ttgttcatca 
ttgacaaagt 
tgcagaggta 



cttgggaaat 
tgatcaacta 
cctagcatcc 
gaatgaaagt 
tttacttatc 
aggctcagca 
ctttaaccgt 
tgttttgagt 
tgacgattct 
taacggataa 



ccaggggata 
gcgaagaaac 
tttttcctaa 
ggaaaagcag 
atttacgatg 
ggtggtcata 
gttaagattg 
aagtttgaca 
gtaaactact 



aatattttga 
agaatgtcac 
atggagaaaa 
ttcatgcttt 
atcttgacat 
atggtatcaa 
gaattggaag 
gggatgagta 
atttacaaga 



aacaaaacac 60 
ttttacacac 120 
aatttatctg 180 
attaacttac 240 
ggaagttggg 300 
gtctattatt 360 
acctaaaaat 420 
tatcggtatt 480 
gaaaaatttt 540 ' 
570 



<210> 40 
<211> 852 
<212> DNA 
35 <213> Streptococcus pneumoniae 

<400> 40 

atgattttaa ttacaggggc aaatggccaa 
gaacgtaatg aagaatacgt ggcagtagat 

40 atggttgaga aagtttttga agaggtgaaa 
accgctgttg atgcagcaga ggatgaagga 
gggacaaaaa atgtcgcaaa agcatctgaa 
acggactatg tctttgacgg taagaaacca 
ccagatccac agacagaata tggacgcact 

45 catgtgtcta atttctatat tatccgtact 
ttcgttttta ccatgcaaaa tcttgcgaaa 
cagtacggtc gtccgacttg gactcgtacc 
aatcgtaagg aatttggtta ttatcattt'g 
tatgattttg cagttgaaat tttgaaagat 

50 agtcaatttc cagccaaagc taaacgtccg 
aaagctactg gatttgttat tccaacttgg 
gaagtgagat aa 

<210> 41 
55 <211> 1224 
<212> DNA 

<213> Streptococcus pneumoniae 



ttaggaacgg aacttcgcta tttattggat 60 
gtggctaaga tggacattac caatgaagaa 120 
ccgactttag tctaccattg tgcagcctac 180 
aaagagttgg acttcgccat caatgtgacg 240 
aagcatggtg caactctagt ttatatttct 300 
gttggacaag agtgggaagt tgatgaccga 360 
aagcgtatgg gggaagagtt agttgagaag 420 
gcctgggtat ttggaaatta tggcaaaaac 480 
actcataaga ctttaacagt tgtaaatgat 540 
ttggctgagt tcatgaccta cctagctgaa 600 
tcaaatgatg cgacagaaga cacaacatgg 660 
acagatgtcg aagtcaagcc agtagattcc 720 
ctaaactcaa cgatgagcct ggccaaagcc 780 
caagatgcat tgcaagaatt ttacaaacaa 840 

852 
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<400> 41 

atgaaacgtt ctctcgactc aagagtcgat tacagtttgc tcttgccagt attttttcta 60 
ctggtcatcg gtgtggtggc tatctatata gccgttagtc atgattaccc caataatatt 120 
5 ctgcccattt tagggcagca ggtcgcctgg attgccttgg ggcttgtgat tggttttgtg 180 
gtcatgctct ttaatacaga atttctttgg aaggtgaccc cctttctata tattttaggc 240 
ttgggactta tgatcttgcc gattgtattt tataatccaa gcttagttgc atcaacgggt 300 
gccaaaaact gggtatcaat aaatggaatt accctattcc aaccgtcaga atttatgaag 360 
atatcctata tcctcatgtt ggctcgtgtc attgtccaat ttacaaagaa acataaggaa 420 

10 tggagacgca cggttccgct ggactttttg ttaattttct ggatgattct ctttaccatt 4 80 
ccagtcctag ttcttttagc acttcaaagt gacttgggga cggctttggt ttttgtagcc 540 
attttctcag gaatcgtttt attatcaggg gtttcttgga aaattattat cccagtattt 600 
gtgactgctg taacaggagt * tgctggtttc ttagctatct ttattagcaa ggacggacga 660 
gcttttcttc accagattgg aatgccgacc taccaaatca atcggatttt ggcttggctc '720 

15 aatccctttg agtttgccca aacaacgact taccagcagg ctcaagggca gattgccatt 780 
gggagtggtg gcttatttgg tcagggattt aatgcttcga atctgcttat cccagttcga 840 
gagtcagata tgatttttac ggttattgca gaagattttg gctttattgg ctctgtcctg 900 
gttattgccc tctatctcat gttgatttac cgtatgttga agattactct taaatcaaat 960 
aaccagttct acacttatat ttccacaggt ttgattatga tgttgctctt ccacatcttt 1020 

20 gagaatatcg gtgctgtgac tggactactt cctttgacgg ggattccctt gcctttcatt 1080 
tcgcaagggg gatcagctat tatcagtaat ctgattggtg ttggtttgct tttatcgatg 1140 
agttaccaga ctaatctagc tgaagaaaag agcggaaaag ttccattcaa acggaaaaag 1200 
gttgtattaa aacaaattaa ataa 1224 

25 <210> 42 
<211> 609 
<212> DNA 

<213> Streptococcus pneumoniae 
30 <400> 42 

atgggaaaaa t.catcggaat cactggggga attgcctcag gtaagtcaac tgtgacaaat 60 

tttctaaaac accaagggct ttcaagcagt ggattgccga cgcagtgttc caccaactac 120 

agaaaacctg gtggtcgtct gtttgaggct ttagtacagc actttgggca agaaatcatt 180 

cttgaaaacg gagaactcaa tcgccctctc atagctagtc tcatcttttc aaatcctgaa 240 

35 gagcaaaaat ggtctaatca aattcaaggg gagattatcc gtgaggaact ggctactttg 300 

agagaacagt tggctcagac agaagagatt ttcttcatgg atattcccct actttttgaa 360 

caggactaca gcgattggtt tgctgagact tggttggtct atgtggaccg agatgcccaa 420 

gtagaacgct taatgaaaag ggaccagttg tccaaagatg aagctgagtc tcgtatggca 480 

gcccagtggc ctttagaaaa aaagaaagat ttggccagcc aggttcttga taataatggc 54 0 

40 aatcagaacc agcttcttaa tcaagtgcat atccttcttg agggaggtag gcaagatgac 600 
agagattaa 609 

<210> 43 
<211> 1260 
45 <212> DNA 

<213> Streptococcus pneumoniae 

. <400> 43 

atgagaaaaa ttgttatcaa tggtggatta ccactgcaag gtgaaattac tattagtggt 60 

50 gctaaaaata gtgttgtggc cttaattcca gctattatat tggctgatga tgtggtgact 120 

ttggattgtg ttccagatat ttcggatgta gccagtcttg tcgaaatcat ggaattgatg 180 

ggagctactg ttaagcgtta tgacgatgtc ttggagattg atccaagagg tgttcaaaat 240 

attccaatgc cttatggtaa aattaacagt cttcgtgcat cttactattt ttatgggagc 300 

ctcttaggcc gttttggtga agcgacagtt ggtctaccgg gaggatgtga tcttggtcct 360 

55 cgtccgattg acttacacct taaggcgttt gaagctatgg gtgccactgc tagctacgag 420 

ggagataaca tgaagttatc tgctaaagat acaggacttc atggtgcaag tatttacatg 480 

gatacggtta gtgtgggagc aacgattaat acgatgattg c'tgcagttaa agcaaatggt 54 0 
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cgtactatta ttgaaaatgc agcccgtgaa cctgagatta ttgatgtagc tactctcttg 600 
aataatatgg gtgcccatat ccgtggggca ggaactaata tcatcattat tgatggtgtt 660 
gaaagattac atgggacacg tcatcaggtg attccagacc gcattgaagc tggaacatat 720 
atatctttag ctgctgcagt tggtaaagga attcgtataa ataatgttct ttacgaacac 780 
5 ctggaagggt ttattgctaa gttggaagaa atgggagtga gaatgactgt atctgaagac 84 0 
agcatttttg tcgaggaaca gtctaatttg aaagcaatca atattaagac agctccttac 900 
ccaggctttg caactgattt gcaacaaccg cttacccctc ttttactaag agcgaatggt 960 
cgtggtacaa ttgtcgatac gatttacgaa aaacgtgtaa atcatgtttt tgaactagca 1020 
aagatggatg cggatatttc gacaacaaat ggtcatattt tgtacacggg tggacgtgat 1080 
10 ttacgtgggg ccagtgttaa agcgaccgac ttaagagctg gggctgcact agtcattgct 1140 
gggcttatgg ctgaaggtaa aactgaaatt accaatattg agtttatctt acgtggttat 1200 
tctgatatta tcgaaaaatt acgtaattta ggagcggata ttagacttgt tgaggattaa 1260 

<210> 44 
15 <211> 696 
<212> DMA 

<213> Streptococcus pneumoniae 
<400> 44 

20 atgtcaagaa ttgaattttc accatctttg atgaccatgg atttggacaa attcaaagag 60 
cagattactt ttttgaatga taaagtagca tcttatcata tcgatattat ggatggccat 120 
tttgttccca atattacctt gtctccttgg ttcattcaag aagttcaaaa aattagtgac 180 
acacctttat cagttcatct gatggtcaca gacccaacct tttgggtaga tcaagttctc 24 0 
gatttacaat gtgagtatat ttgtattcat gctgaagttc tgaatggtct tgcttttcgt 300 

25 ttgattgata aaattcatga tgcaggtcta aaggctggtg ttgtccttaa tcctgaaaca 360 
cctgtttcta caatctttcc ctacattgat ttacttgaca aagtaactat tatgactgta 420 
gatccaggtt ttgcaggaca acgctttttg gagtctacct tgtataaaat ccaagaactc 480 
cgtcagctta gagttcagaa tggttatcac tacatcattg agatggatgg ttcttcgagt 540 
cgtaagactt tcaaacaaat tgatgtggca ggaccagata tttatgttat aggtcgcagt 600 

30 ggattatttg gtttggatga cgatattgcc aaagcctggg atatctgttc tagagattac 660 
gaagaaatga ccggaaaaac aatgccaatc aaataa 696 

<210> 45 
<2il> 1125 
35 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 45 

atgagaaata tggctttgac agcaggtatc gttggtttgc caaacgttgg taaatcaaca 60 

40 ctatttaatg caattacaaa agcaggagca gaggcagcaa actacccatt tgcgactatt 120 
gatccaaatg ttggaatggt ggaagatcca gatgaacgcc tacaaaaact aactgaaatg 180 
ataactccta aaaagacagt tcccacaaca tttgaattta cggatattgc agggattgta 240 
aaaggagctt caaaaggaga agggctaggg aataaattct tggccaatat tcgtgaagta 300 
gatgcgattg ttcacgtagt tcgtgctttt gatgatgaaa atgtgatgcg cgagcaagga 360 

45 cgtgaagacg cctttgtaga tccacttgca gatattgata caattaatct ggaattaatt 420 
cttgctgact tagaatcagt gaacaaacga tatgcgcgtg tagaaaagat ggcacgtacg 480 
caaaaagata aagaatcagt agcagaattc aatgttcttc aaaagattaa accagtccta 540 
gaagacggga aatcagctcg taccattgaa tttacagatg aggaacaaaa ggttgtcaaa 600 
ggtcttttcc ttttgacgac taaaccagtt ctttatgtag ctaatgtgga cgaggatgtg 660 

50 gtttcagaac ctgactctat cgactatgtc aaacaaattc gtgaatttgc agcgacagaa 720 
aatgctgaag tagtcgttat ttctgcgcgt gctgaggaag aaatttctga attggatgat 780 
gaagataaaa aagagtttct tgaagccatt ggtttgacag aatcaggtgt agataagttg 840 
acgcgtgcag cttaccactt gcttggattg ggaacttact tcacagctgg tgaaaaagaa 900 
gttcgcgctt ggactttcaa acgtggtatg aaggctcctc aagcagctgg tattatccac 960 

55 tcagactttg aaaaaggctt tattcgtgca gtaaccatgt catatgaaga tctagtgaaa 1020 
tacggatctg aaaaggccgt aaaagaagct ggacgcttgc gtgaagaagg aaaagaatat 1080 
atcgttcaag atggcgatat catggaattc cgctttaatg tctaa 1125 
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10 



20 



25 



30 



40 



45 



50 



55 



<210> 46 
<211> 333 
<212> DNA 

<213> Streptococcus pneumoniae 



<400> 46 
atggaaatcg 
acagataagc 
atagctgagg 
aagattctgg 
cagatttttg 
'gaaattttaa 



aaaaaaccaa 
aaatgaatta 
agtttggtgt 
aagattatga 
accaaatctt 
caagcattga 



tcgtatgaat 
tatagagctt 
tagtcgtcag 
gatgaaattg 
ggagcgctat 
taatagagaa 



gcgctctttg 
tactacgctg 
gctgtctatg 
cacatgtact 
cccaaggatg 
taa 



aattttatgc 
atgattacag 
acaatatcaa 
cggactacat 
attttctgca 



ggcgcttttg 60 
tcttgctgag 120 
gcgaacagaa 180 
tgtccgtagt 24 0 
ggagcagata 300 
333 



15 <210> 47 
<211> 672 
<212> DNA 

<213> Streptococcus pneumoniae 



<400> 47 

atgaccttag 

attaaacttc 

tttgtgaccg 

attacttatg 

cagtttgtag 

atcatacagg 

gtggtagtag 

caaattcgta 

taccaagggg 

catcagttgg 

tttgatcctt 

gaatacaggt 



aatgggaaga 
gtggtattcg 
gtcgagtcaa 
cgaccttgga 
atgacgtcaa 
agcgagatta 
aatatacggt 
ctttggccat 
atttcccaga 
atgaagaaat 
tgagtagaaa 
aa 



atttctagat 
taagcaatat 
gccaattgag 
acacgatttg 
ggaagtagtg 
cattactcat 
tgataccatc 
gaatttctgg 
tgagattaag 
gggtgaaatt 
attaaatgac 



ccttacattc 
cgtaagcaaa 
agcatcaaag 
caggatattg 
gatattttgc 
agaaaagcat 
aatggagcta 
gcaacgatag 
aagcgactgg 
cgtgatgata 
ggtgtaggaa 



aagctgttgg 
ataagcattc 
aaaaaatggc 
ctggcttacg 
acaagcgtca 
caggctatcg 
agactatttt 
aacattctct 
aaattacagc 
tccaagaagc 
acagtgacga 



tgagttaaag 60 
tccaattgag 120 
tcgtcgtggc 180 
tgtgatggtt 24 0 
ggatatgcga 300 
ttcctatcat 360 
ggcagaaatt 420 
caactacaag 480 
tagaatcgcc 540 
ccaggcactt 600 
tacagatgaa 660 
672 



<210> 48 
35 <211> 588 
<212> DNA 

<213> Streptococcus pneumoniae 



<4O0> 48 

atggaactta atacacacaa tgctgaaatc 
ccgcaggatg aactgccaga gattgcccta 
tttatcaaca ctatgttgaa ccgtaagaat 
acccagctcc tgaacttttt taacattgat 
tatggctatg ctcgtgtttc taaaaaggaa 
tacttaacga ctcgggaaaa tctccgtgcg 
ccgtcagcag atgatgtgca gatgtacgaa 
attgtggcga ccaaggcgga caagattcct 
atcaaaaaga aattaaactt tgacccaagt 
aaggcaggga tggatgaggc ttgggatgca 

<210> 49 
<211> 294 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 49 

atgaaaacaa gaaaaatccc tttgcgcaag 



ttgctcagtg 
gcagggcgtt 
ctcgctcgta 
gacaagatgc 
cgtgaaaagt 
gttgtcagtc 
tttctcaagt 
cgtggtaaat 
gacgatttca 
atcttagaaa 



cagctaataa 
caaatgttgg 
catcaggaaa 
gctttgtgga 
gggggtgcat 
tagttgacct 
attatgagat 
ggaacaagca 
tcctcttttc 
aattgtga 



gtcccactat 60 
taaatccagc 120 
acctggtaaa 180 
tgtgcctggt 240 
gattgaggag 300 
tcgtcatgac 360 
tccagtcatc 420 
tgaatcagca 480 
atctgtcagc 540 
588 



tctgttgtgt ctaacgaagt gattgataag 60 
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cgtgatttgc tccgcattgt taagaacaag 
aaggccaatg gccgcggcgc ttatatcaaa 
aagaagaagg tctttaaccg cagctttagc 
ttgatcgctt atgtggatca caaagtgaaa 

5 

<210> 50 
<211> 312 
<212> DNA 

<213> Streptococcus pneumoniae 

10 

<400> 50 

atgttaaaac cctctattga taccttgctc 
atcttggaag caaaacgtgc ccacgaattg 
aagtctgaaa aatcaactct tcgcgcttta 
15 cacccagatc cagaaggaaa acgtgaagca 
cgcaaagaag aagaagaaaa gaaaatcaaa 
gaaaaaattt aa 

<210> 51 
20 <211> 312 
<212> DN£ 

<213> Streptococcus pneumoniae 
<400> 51 

25 atgtcattaa catcaaaaca acgtgccttc 
atcatccaaa tcgggaaaaa tggactcaac 
cttgatgcgc gtgaattaat caaggttact 
gaagtagctg aaattttgga agaagaaatc 
atcttgattt tgtttaaaca atctagcaag 

30 aaagaaatct aa 

<210> 52 
<211> 528 
<212> DNA 
35 <213> Streptococcus pneumoniae 

<400> 52 

atggcgattg aaaattatat accagatttt 
ccaagcctgc aggcgcaggg aatcaaggct 

40 gcttggaaca accctgatgg aacgccagag 
gcgggtattg gcattatcgt agtgtcaaat 
gagaaatttg- ggattgatta cgtttactgg 
cgtgctatga aggaattcca ctatgacaaa 
atgacagata tacgagcagc ccaccgtgca 

45 gtccaacatg actcaatcaa aacgcagatt 
aaaatcactg aaaagtacgg accgattaca 

<210> 53 
<211> 1368 
50 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 53 

atgtttcgaa aaattttaat tgccaatcgt 
55 gcacgtgaat tggggattgc gacggtagcg 
catacgctgt tggcagatga agcagtttgt 
ctcaatatta atgcagttct atcagctgca 



gaaggacaag tctttattga tcctacgggc 120 

ctagacaatg cagaagccct agaggcgaaa 180 

atggaagtgg aagaaagctt ttatgacgag 240 

agaagagagt tgggacttga ataa 294 



gacaaggttc cttcaaaata ttcactcgta 60 
gaagcaggtg ccccagcaac tcaaggtttc 120 
gaagaaatcg aatcaggaaa cgttacaatt 180 
gtgcgtcgcc gtatcgaaga agaaaaacgc 240 
gagcaaattg ctaaagaaaa agaagatggt 300 

312 



ctcaacagcc aggcacacac cctcaaacct 60 
gaccaaatca aaaccagcgt ccgtcaagct 120 
ctcttacaaa acacagatga aaacatccac 180 
ggtgtggata cagtccaaaa aataggacgc 240 
aaagaaaatc gcaagatttc taagaaagtc 300 

312 



gctgtggaag cagtctatga tctgacagtc 60 
gttttggtcg atttggataa taccctcatt 120 
atgaagcaat ggctacatga ccttcgggac 180 
aacaccaaaa aacgcgttca acgagcagtt 240 
gccttgaagc* ccttcacatt tggtattgac 300 
aaggaagtgg tcatggttgg tgaccagctc 360 
gggattcggt caattttagt caaacccttg 420 
aaccgaactc gtgagcgtcg tgttatgaga 480 
tataaaaaag gaatttaa 528 



ggtgaaattg cggttcgtat tatccgtgcg 60 

gtttattcaa ctgctgataa ggaagctctt 120 

attggtcctg gcaaggcaac agagtcttat 180 

gtcttgactg aggcagaagc tattcaccct 240 
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ggttttggat ttctcagtga aaattccaaa 
aagtttatcg gtccatctgg tcatgttatg 
gctcagatga ttaaagcagg tgtgcctgtt 
tctgaagaag ctttgattgt tgctgaaaaa 
5 * gcaggtggag gtggtaaagg gattcgtaag 
tttgaaactg cctctagtga ggccaaggcc 
cgggttatct atccagctcg gcacattgag 
gtgattcact tgggtgaacg ggattgttct 
gaaagtccct cgattgcaat cggaaaaacg 

10 cgagcggcag agtttgttgg ctatgagaat 
gcaagtagca atttctattt catggagatg 
acagagtttg tttcaggtgt tgatatcgtt 
cctttgtctg ttaagcaaga agatattgtc 
aatgcagaaa acccagcctt taactttgct 

15 ctgccaagtg gtggagttgg cttgcgcgtg 
ccgccttatt atgatagtat gattgccaaa 
gccttgatga aaatgcaacg tgccctctat 
gcagatttcc agcttgacct catttcagat 
tgcttcttga tggaaacctt cttacctaaa 

20 

<210> 54 
<211> 234 
<212> DNA 

<213> Streptococcus pneumoniae 

25 

<400> 54 

atgatttaca aagtttttta tcaagaaaca 
cgcgcgcttt acctagacat egataccagc 
caacttgtcg aagaaaatcg cccagagtac 
30 aaattgctcg attacgaaaa agaaactggc 

<210> 55 

<211>. 1011 

<212> DNA 

35 <213> Streptococcus pneumoniae 

<400> 55 

atgaaggata gatatatttt agcatttgag 
ttgaaaaacg acgatgagct cttgtccaat 

40 cgttttggtg gcgtagtgcc cgaagtagcc 
-tgtatcgagg aggcattggc agaagcaggg 
gttacctacg gaccaggctt ggtcggagcc 
tttgcttggg ctcacggact tccactgatt 
gcagctcaga gtgtggagcc tttggagttt 

45 cacacagagt tggtctatgt ttctgaggct 
gatgacgcgg ttggcgaggc ctatgataag 
gcaggtcgtg agattgacga gctggctcat 
gccatgatta aggaagataa tctggagttt 
aatcttcatc acaatgccga gcaaaaggga 

50 tccttccaag cagcagttat ggacattctc 
tatcctgtta aaaccctatt tgtggcaggt 
cgcttagcag ccgaaatcac agatgtcaag 
gacaatgcag gtatgattgc ctatgccagc 
ggctgggacc tcaatgccaa accaagtctt 

55 

<210> 56 
<211> 1809 



tttgcgacca tgtgtgaaga aataggtatc 300 
gatatgatgg gggataaaat caatgcgcgt 360 
ataccaggtt cagatggaga agtgcataac 420 
attggctatc ctgttatgct caaggcttca 480 
gttgaaaaac cagatgacct cgtttctgcc 540 
aattatggca atggtgccat gtacatagaa 600 
gttcaaatcc taggtgatga gcatggacat 660 
cttcaaagga ataaccaaaa ggttttggaa 720 
ctgcgtcatg aaataggtgc tgctgctgtt 780 
gcaggaacca ttgaatttct tcttgatgaa 840 
aatactcgtg ttcaggtaga acatccagta 900 
aaggaacaga tttgcattgc ggcaggtcag 960 
ctacgcggtc atgccatcga gtgtcgtatc 1020 
ccaagtccag gtaagattac taatctctat 1080 
gattcagcag tttatccagg ttataccatt 1140 
atcatagtac acggcgaaaa tcgttttgac 1200 
gaattagaaa ttgaaggagt gcagaccaat 1260 
cgcaatgtca ttgctgggga ttatgatact 1320 
tatcaagaaa aagaataa 1368 



aaagaacgta gcccacgccg tgaaacaaca 60 

tcagaacttg agggccgtat cactgctcgc 120 

aatatcgaat atatcgaact cttgtctgac 180 

gccttcgaaa ttacggagtt ctaa 234 



acatcctgtg atgagaccag tgtcgccgtc 60 
gtcattgcta gtcaaattga gagtcacaaa 120 
agtcgtcacc atgtcgaggt cattacagcc 180 
attaccgaag aggacgtgac agctgttgcg 240 
ttgctagttg gtttgtcagc tgccaaggcc 300 
cctgttaatc acatggctgg gcacctcatg 360 
cccttgctag ccctcttggt cagcggcgga 420 
ggcgattaca agattgttgg ggagacacga 480 
gtcggccgtg tcatgggctt gacctatcct 540 
caggggcagg atatttatga tttcccccgt 600 
tcattctctg gtttgaagtc agcctttatc 660 
gaaagcctgt ctacagaaga tttgtgtgct 720 
atggcaaaaa ccaagaaggc tttggaggaa 780 
ggtgtggcag ccaataaagg tctcagagaa 840 
gttatcatcc cccctctgcg actctgcgga 900 
gtcagcgagt ggaacaaaga aaacttcgca 960 
gcctttgata ccatggaata a 1011 
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<212> DNA 

213> Streptococcus pneumoniae 
<400> 56 

"5 atgtgtggaa ttgttggtgt tgttggaaac acaaatgcaa ctgatatttt gattcaaggg 60 
cttgaaaagc ttgaataccg tggctatgat tctgcgggaa tttttgtcct agatggtgct 120 
gataaccatt tggtgaaggc ggttggtcgt attgcagaat tgtctgccaa gacagctggt 180 
gttgagggaa caactggtat cggacatact cgttgggcta ctcacggaaa accaactgag 24 0 
gacaatgctc acccacaccg ctctgagaca gaacgttttg tcttggtgca taatggggtg 300 

10 attgagaact atcttgaaat caaggaagaa taccttgcag gtcaccactt caaggggcag 360 
acagatactg aaatagccgt tcatttgatt ggaaaatttg cggaagaaga agggctctca 420 
gttcttgaag cctttaaaaa agctcttcat attatccgtg gttcatatgc ctttgccttg 480 
attgactctg aaaatccaga tgtcatctat gtagcgaaaa acaaatctcc acttttgatt 540 
ggtcttgggg aaggctacaa tatggtctgc tcagatgcta tggctatgat tcgtgaaacc 600 

15 aaccaataca tggaaattca tgaccaagag ttggtaatcg tcaaggctga tagcgtggaa 660 
gttcaagact atgatggtaa cagtcgtgaa cgtgctagct atactgcgga acttgacttg 720 
tcagatatcg gtaagggaac ttatccttac tacatgctta aggaaattga tgagcaacca 780 
actgttatgc gtaaactcat tcaagcctac acggatgatg ctggtcaagt agtggttgct 840 
cctgctatca ttaaggctgt tcaagacgca gaccgcatct acatccttgc agctggaaca 900 

20 tcttaccatg caggatttgc ttctaagaaa atgttggaag aattgacaga tacaccagtt 960 
gaacttggaa tctcatctga gtggggctac ggtatgccac ttctcagcaa gaaaccactc 1020 
ttcatcttta tcagccaatc tggtgaaaca gcggatagtc gtcaagtttt ggtcaaggct 1080 
aatgaaatgg gaattccaag cttaacagtg acaaatgttc caggttcaac cctctcacgt 1140 
gaagccaact ataccatgct ccttcacgca ggacctgaaa ttgccgtggc atcaactaaa 1200 

25 gcctatacag cgcaaatcgc agcccttgcc ttccttgcaa aagcagtcgg agaagcaaat 1260 
ggtaatgcta aagcgcaagc ctttgacctg gttcatgaat tgtcaatcgt agctcagtct 1320 
attgaatcaa ctctttcaga gaaagaaacc attgaagcca aggttcgtga acttcttgaa 1380 
acaactcgta acgcctttta catcggacgt ggtcaagatt actacgtagc catggaagca 1440 
agtctcaaac tcaaagagat ttcttatatc cagtgtgaag gttttgcggc aggagaactc 1500 

30 aagcacggaa ccattgcctt gattgaagaa ggaacgcctg ttttggctct cttgtcagat 1560 
ccagttcttg ccaaccatac tcgtggaaat atccaagagg tcgcagcccg tggtgccaaa 1620 
gtcctcacta tcgcagaaga gaatgtagcc aaagataccg acgatatcgt ccttacgacc 1680 
gtacatccat acctctcacc aatttcaatg gtcgtaccaa cgcaattagt cgcttacttt 1740 
gcaaccctcc accgtggcct cgatgtggac aaaccacgta accttgccaa gtcagtaacg 1800 

35 gtagaataa 1809 

<210> 57 

<211> 723 

<212> DNA 

40 <213> Streptococcus pneumoniae 

<400> 57 

atgatacgta tcgaaaatct cagtgtctcc tacaaagaaa cgttggcact taaggatatt 60 
tcactagtgc tccatggacc aacaattacc ggcatcattg gtccaaacgg cgctgggaaa 120 

45 tcaacactat taaaaggtat gctgggaatt atcccacatc aaggtcaggc atttctcgat 180 
gacaaggaag ttaaaaaatc cttacaccga attgcctatg tcgaacaaaa aatcaatatc 240 
gactacaact ttcccatcaa ggtcaaggaa tgcgtctcgt taggactatt tccctctatt 300 
cctctctttc gaagtttaaa ggctaaacat tggaagaaag tgcaagaggc ccttgaaatc 360 
gtcggcctag ctgactacgc tgaacgtcaa attagtcaac tgtctggagg tcaattccag 420 

50 cgggtcttga ttgceagatg tttggtgcag gaagccgact atatcctctt ggatgaaccc 4 80 
tttgctggga ttgactctgt cagtgaggaa atcatcatga atacgctgag agatttgaaa 540 
aaagctggga agacggttct catcgttcac cacgacctca gcaagattcc ccactacttc 600 
gatcaagtct tacttgtcaa tcgagaagtg attgcctttg gtccaacaaa agaaactttt 660 
accgaaacca atctaaaaga agcttacggt aatcaactct ttttcaatgg aggtgaccta 720 

55 tga 723 

<210> 58 
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<211> 2223 
<212> DNA 

<213> Streptococcus pneumoniae 

5 <400> 58 

atgccgaaag aagtgaattt aacaggcgaa 
acggaagagg atgttcattt tgtccataag 
ggtcaatatc gcaaatcagg cgagccttat 
ttagctaagc taaagctgga tgctgtaaca 
10 gaagatacag atgcgacctt ggacgatttg 
attgttgacg gagttaccaa gcttggcaag 
gcggaaaatc atcgcaagat gctcatggcc 
aaactgtctg accgcttgca caatatgc'gg 
gagcgtattt ccaaagaaac catggaaatc 

' 15 tccagtgtca aatgggaatt agaagacttg 
tacaagatta cccatatgat gaaggaaaag 
gtagtcacaa aattagagga gtatacgaca 
cgtcccaagc atatttactc aattttccgc 
gaaatctatg atctgattgc tattcgttgt 

20 atgcttggtt acgtgcatga attttggaaa 
gccaaccgca aggccaatgg ttatcagtct 
ccgattgaat tccagattcg aaccaaggaa 
gctcactggg cttataagaa aggtataaag 
ggaatgaact ggatcaagga gatgatggag 

25 tttgtggact ctgttaagga aaactatctg 
ggagctgtcc gttcccttcc caaagattca 
accaaggtcg gtgaaaaagc aactggtgcc 
accaagttaa agacagggga tcaggttgaa 
agccgtgact ggctcaatat ggtcaagact 

30 tttaaaaacc aagataagga attgtctgtc 
ttccaagaaa atggctatgt ggcaaataaa 
ctgcaaaaga ccagttacaa gacagaagac 
atcggtgcga ttaccgtctt taaccgtctg 
gccaaggcca aggctgaggc agaggagctt 

35 aaagaaactc tcaaggtcaa gcatgagggg 
ctagtgcgga ttgctaagtg ttgtaacccc 
accaagggtc gtggtgtggc tattcaccgt 
aactacgagc aacgtctcct tgatgtggaa 
gagtatctgg cccatatcga tatctacggt 

40 ctgcaagttc tttcaaatac aaccaagaat 
gatatgaagt ttgctaatat ccatgtgtcc 
acggttgtcg ataaaattaa gagtgtgcca 
tag 

45 <210> 59 - 
<211> 1479 
<212> DNA 

<213> Streptococcus pneumoniae 



gaagttgtcg ctttaaccaa agaatattta 60 
gccttggtct atgctgttga atgccacagt 120 
atcattcacc ctatccaagt ggcaggtatt 180 
gtagcttgtg gattcttgca tgatgtggtg 240 
gaaagagagt ttggtcctga tgtgcgggtg 300 
gtcgagtaca aatcgatcga ggagcaatta 360 
atgtctgagg acatccgcgt tattttggtc 420 
accctgaaac atcttcgaaa agacaagcag 480 
tatgccccac ttgcccatcg tttggggatt 540 
tctttccgtt atctcaatcc aacggagttt 600 
cgcagggagc gtgaggcctt ggtggatgag 660 
gaacgtcact tgaaagggaa gatttatggt 720 
aaaatgcagg acaagagaaa acggtttgag 780 
attttagata cccaaagtga tgtttatgcc 840 
ccgatgccag gtcgcttcaa agactatatt 900 
atccatacga ctgtttatgg accaaaaggg 960 
atgcacgagg tggctgagta cggggttgcg 1020 
gggcaagtta acagcaagga atcagct'att 1080 
ctccaagacc aggctgatga tgctaaggaa 1140 
gctgaggaga tttacgtttt taccccagat 1200 
ggaccgattg attttgccta cgaaatccat 1260 
aaggtcaatg gccgcatggt tccactgaca 1320 
attatcgcca acccgaactc ctttggacct 1380 
agcaaggcgc gcaataagat tcgccagttc 14 40 
aacaagggtc gtgagatgct gatggctcag 1500 
tttatggaca agcgccacat ggatcaagtt 1560 
tccctctttg cggccattgg ttttggggaa 1620 
actgaaaagg aacgccgtga ggaagagcgt 1680 
gtcaaaggtg gcgaggtcaa ggttgaaaat 1740 
ggagtggtta ttgaaggtgc ttctggtctc 1800 
gtgcctggtg acgatattgt tggctacatt. 18 60 
gtggactgta tgaacctgcg tgcccaagaa 1920 
tgggaagacc agtactctag ctcaaataag 1980 
ctcaaccgta caggactgtt gaacgatgta 2040 
atttcaacgg tcaatgccca accaaccaag 2100 
ttcggtattg ccaacctctc tacactgacc 2160 
gaagtttact ctgtcaaacg gaccaacggc 2220 

2223 



50 <400> 59 

atgtctaatt gggacactaa atttttgaaa 
attccagctg aaagtcatgt gttgcctaac 
aatttgactt taaatatccc aattattacc 
atggccattg ctattgctcg tgcaggcggt 

55 gctcaacaag cagacgaggt tcgtaaggta 
ccgttcttct tgacgcctga acatacaatt 
cgcatcagtg gtgttccagt tgttgaaaca 



aaaggtttta cctttgatga tgtattgctt 60 

gatgcagatt taacaactaa attggcagat 120 

gctgccatgg acacagttac agagagtcaa 180 

ctcggagtta tccataaaaa catgtcaatt 240 

aaacgttctg aaaatggagt tattattgat 300 

gctgaagcag atgagcttat gggtcgttac 360 

cttgaaaatc gtaaattggt tggtattttg 420 
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acaaaccgag atcttcgttt tatttcagat 
agtgaaaatc ttgttactgc tcctgtgggt 
caagagcatc gtattgaaaa acttccgttg 
atcactatca aagatattga aaaagttatt 
5 ggtcgtcttc tagtt'gcagg tgcagtaggt 
gctctttttg aggcaggagc ggatgcgatt 
ggtgtcttgc gtaaaattgc cgagattcgt 
ggaaatattg ctactgctga aggtgcacgt 
aaggttggta ttggaccagg ttctatctgt 

10 ccgcaagtaa cagctatcta cgatgctgca 
attgctgacg gtgggatcaa gtattctgga 
aatgctgtta tgcttggatc tatgtttgct 
atcttccaag gacgtaaatt caagacttac 
aaaggttcaa gcgaccgtta tttccaaggt 

15 gaaggaattg aaggtcgtgt tgcttataaa 
attggtggta ttcgctctgg tatgggttac 
gataatgctc aatttattga aatgtctggt 
gtgcaaatta ctaatgaggc accaaattat 

20 <210> 60 

<211> 1947 
<212> DNA 

<213> Streptococcus pneumoniae 

25 <400> 60 

atgacagaag aaatcaaaaa tctgcaggca 
ttagagggct tagaggctgt tcgtatgcgt 
gaaggtcttc accatctagt ctgggaaatt 
ggatttgcca gccatattca agtttttatt 

30 gatgggcgtg gtatcccagt cgatattcag 
gtctttacag tccttcacgc tggaggaaag 
ggtcttcacg gggtggggtc gtcagtagtt 
gttcacaaaa atggtaagat tcattaccaa 
cttgaaatag ttggagatac ggataaaaca 

35 aaaatcttca ctgaaacaac aatctttgat 
ttggcctttc taaatcgcgg tcttcaaatt 
caaaccaagc attatcatta tgaaggtggg 
aacaaggatg taatctttga tacaccaatc 
gttgaggtag ccatgcagta cacaactggt 

40* aatattcata cacatgaagg tggaacgcat 
gttatcaatg attatgctcg taaaaataag 
ggggaagatg ttcgcgaagg cttaactgca 
tttgaaggac aaaccaagac caaattggga 
ctcttcagtg aagccttctc cgatttcctc 

45 gtggaaaaag ggattttagc tgccaaggct 
acacgtaaaa aatctggttt ggaaatttcc 
tctaataacc ctgctgaaac agaactcttc 
gccaaatctg gtcgtaaccg tgagtttcag 
aacgttgaaa aagcaagtat ggataagatt 

50 acagccatgg gaacaggatt tggcgcagaa 
ctcgttttga tgaccgatgc cgatgtcgat 
ttgatttatc gttatatgaa accaatccta 
ccaatctatg gtgtcaaggt tggaagcgag 
caagaaatca aactccaaga agctttagoc 

55 attcagcgtt ataaggggct aggtgaaatg- 
gatcccgaac atcgcttgat ggctagagtt 
atctttgata tgttgatggg ggatcgagta 



tataatcaac caatttcaaa ccatatgact 480 
acggatcttg caacggctga gagtattctt 540 
gtcgatgaag aaggcagtct ttctggtttg 600 
gagtttccaa atgcggctaa agatgagttt 660 
gttacttcag atacatttga acgtgcagag 720 
gttattgata ctgcacatgg tcattctgca 780 
gctcatttcc cagatcggac tttgattgct 840 
gccctttatg aagcgggtgt agacgttgtt 900 
actactcgtg tgattgctgg tgttggtgtt 960 
gctgttgcgc gcgaatatgg taaaacgatt 1020 
gatattgtaa aagcacttgc tgcaggtgga 1080 
ggaactgatg aagctccagg cgaaactgaa 1140 
cgtggtatgg gatcaattgc tgctatgaag 1200 
tctgtcaatg aagcaaacaa gcttgttcca 1260 
ggagcggcag ctgatattgt tttccaaatg 1320 
tgtggtgcag ctaaccttaa agaactacac 1380 
gctggtttga aagaaagcca tcctcatgat 1440 
tctatgtaa 1479 



caggattatg atgccagtca aattcaagtt 60 
ccagggatgt acattggatc aacctcaaaa 120 
gttgataact caattgacga ggccttggca 180 
gagccagatg attcgattac tgttgtggat 240 
gaaaaaacag gccgtcctgc tgttgagacc 300 
ttcggcggtg gtggatacaa ggtttcaggt 360 
aatgcccttt ccactcaatt agacgttcat 420 
gaataccgtc gtggtcatgt tgtcgcagat 480 
ggaacaactg ttcacttcac accggaccca 540 
tttgataaat taaataaacg gattcaagag 600 
tcaattacag ataagcgcca aggtttggaa 660 
attgctagtt acgttgaata tatcaacgag 720 
tatacagacg gtgagatgga tgatatcaca 780 
taccatgaaa atgtcatgag tttcgccaat 84 0 
gaacaaggtt tccgtacagc cttgacacgt 900 
ttactgaaag acaatgaaga caacctaaca 960 
gttatctcag ttaaacaccc aaatccacag 1020 
aatagcgaag tggtcaagat taccaatcgc 1080 
atggaaaatc cacagattgc caaacgtatc 1140 
cgtgtggctg ccaagcgtgc gcgtgaagtc 1200 
aaccttccag ggaaactagc agactgttct 1260 
atcgtcgaag gagactcagc tggtggatca 1320 
gctatccttc caattcgcgg taagattttg 1380 
ctagctaacg aagaaattcg tagtcttttc 14 40 
tttgatgttt cgaaagcccg ttaccaaaaa 1500 
ggagcccaca ttcgtaccct tcttttaacc 1560 
gaagctggtt atgtttatat tgcccaacca 1620 
attaaagaat atatccagcc gggtgcagat 1680 
cgttatagtg aaggtcgtac caaaccgact 1740 
gacgatcatc agctgtggga aacaaccatg 1800 
tctgtagatg atgctgcaga agcagataaa 1860 
gagcctcgtc gtgagtttat cgaagaaaat 1920 
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gctgtctata gtacacttga tgtctaa 1947 

<210> 61 
<211> 267 
5 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 61 

atgggattta ctgaagaaac agtacgtttt aaattggacg attccaataa aaaagaaatt" 60 

10' agcgaaactt tgacagatgt ttatgcttcg ttgaacgata agggttacaa cccaattaac 120 

caaatcgtag gttacgtatt gagtggagac cctgcctacg ttcctcgtta taataatgca 180 

cgaaatcaaa tccgtaagta tgagcgtgat gaaatcgttg aggaattggt tcgctactac 240 

ctcaaaggac aaggagtcga tctataa 267 

15 <210> 62 
<211> 597 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 62. 

atggtcaact atccacataa agtttcatca caaaaaagac aaacatctct ttctcaaccc 60 

aaaaatttcg caaatcgagg aatgtctttt gaaaagatga tcaatgctac caacgactac 120 

tatttgtctc agggcttggc tgttatacat aagaaaccaa ctcctattca aatcgtacaa 180 

gtggactatc cacaacgaag tcgtgccaag attgttgaag cctattttcg acaagcttca 240 

acgacggact attctggcgt ttataatgga tattacatcg actttgaagt caaggaaaca 300 

aaacaaaaac gtgcgattcc gatgaaaaat tttcatccac atcagattca gcatatggaa 360 

caagtccttg cccaacaagg aatctgcttt gtccttcttc acttttcttc tcagcaagaa 420 

acctacttat tgccggcatt cgatttgatt cgcttctatc atcaagataa gggacaaaaa 480 

tcaatgccac ttgaatatat tcgagaatat ggatatgaaa tcaaggctgg tgccttccct 54 0 

caaattcctt atctcaatgt tatcaaagaa catttattag gtggtaaaac aagatga 597 

<210> 63 

<211> 867 

<212> DNA 

35 <213> Streptococcus pneumoniae 



20 



25 



30 



<400> 63 

atggctctat ttagtaaaaa agataagtat attcgaatca atcccaatcg ttcggttagg 60 
gaaaaacctc aagctaagcc agaggttcca gatgaattat tttcccagtg tccaggctgt 120 

40 aagcatacca tctatcagaa ggatctggga agtgaacgta tctgtccgca ctgtagctat 180 
acctttcgta tttctgccca agaacgcttg. gctttgacga ttgatatggg aaccttcaaa 240 
gaattgttta cagggattga aagcaaggat cccttgcatt tccctggtta ccaaaagaaa 300 
ctggcatcta tgcgtgaaaa aacaggtctg catgaagccg ttgtgacagg aactgctctt 360 
attaaaggtc agactgtggc tcttgggatt atggattcta actttatcat ggcttctatg 420 

45 ggtacggttg taggtgaaaa aatcactcgt ttgtttgagt atgcgactgt cgaaaaattg 480 
ccagttgtcc tattcacagc ctctggtgga gcccgtatgc aggaaggaat- catgagtctc 540 
atgcagatgg ctaagatctc tgcggcggtt aaacgccatt caaatgctgg tctcttttac 600 
ctgaccattt tgacagatcc aacgactggt ggtgtgacag cttctttcgc tatggaaggc 660 
gatatcattc tggctgaacc acagagcttg gttggttttg ctggacgtcg tgtgattgaa 720 

50 aatacggttc gtgaaagctt gcctgaggat ttccaaaagg cagaattcct attagaacat 780 
ggctttgtgg atgctattgt caaaagaaga gacttaccag atacgattgc tagcctagtc 840 
agattgcatg gagggagtcc tagatga 867 

<210> 64 

55 <211> 420 

<212> DNA 

<213> Streptococcus pneumoniae 
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<400> 64 

atgagaat'ta tgggattgga cgtcggttca aaaacggtag 
cttggtttta cagctcaagg gcttgaaatc atccagataa 
ggttctgacc gcgttaagga attggttgat acttacaagg 
ttgcctaaaa acatgaacaa tacaagtgga ccgcgcgtag 
gcaaagctag aagagttttt tggtttacca gtagactatc 
gtggctgctg agcgcatgtt gattgaacaa gcagatatca 
gtcattgata agttagcagc tcagctgatt ttacaaaatt 



gggtggcgat 
atgaagaaca 
tggaacgatt 
aagctagtca 
aggatgaacg 
gtcgcaataa 
atttagatag 



tagcgatccg 
aggccaattt 
tgtagtgggc 
agcatacgga 
cttgacaaca 
gcgcaagaaa 
aaaattttaa 



<210> 
<211> 
<212> 



65 

1197 
DNA 



<213> Streptococcus pneumoniae 
<400> 65 

atggcaaaac ttactgttaa agacgttgac 
gacttcaacg taccattgaa agatggcgta 
cttccaacta ttaagtacat catcgaacaa 
ggacgtgtga aagaagaagc tgataaagct 
ttggcagcaa aacttggtca agatgttgtt" 
gaagcggcaa tcaacgctct tgaagatgga 
gaagatgttg acggcaagaa agaatctaaa 
tcacttggag atggt'atctt cgtaaacgat 
tctaacgttg gtatctcagc aaacgttgaa 
gaaattgcct acatccaaga agcagttgaa 
ggtggttcaa aagtttcaga caagatcggt 
aaagtcctta tcggtggtgg gatgacttac 
ggtaactcac ttgtagaaga agacaaattg 
aatggtaaat tgatcttgcc agttgactca 
gaagtgcgtg acactgaagg tgaagcagtt 
ccaaaatcta tcgccaaatt tgacgaagct 
ggacctatgg gtgtatttga aaacccagat 
gctatcgtga aacaaccagg agttaaatca 
gcgattaacc ttggccgtgc agacaagttc 
atggaacttc ttgaaggtaa ggttcttcca 



ttgaaaggta 

atcactaacg 

ggtggacgtg 

ggtaaatcac 

ttcccaggtg 

caagttctct 

aacgatcctg 

gcattcggta 

aaagcagttg 

actccagaac 

gttatcgaaa 

acattctaca 

gatgttgcga. 

aaagaagcta 

tctgaaggct 

ttgactggtg 

ttccaagctg 

atcatcggtg 

tcatggatta 

caacttgcag 



aaaaagtcct cgttcgtgtt 
ataaccgtat cacagcagct 
caattctttt ctctcacctt 



ttgctcctgt 
tcactcgtgg 
tggttgaaaa 
aacttggtaa 
cagctcaccg 
ctggtttcct 
gtccattcgt 
acttgcttga 
aagcacaagg 
aagctcttct 
acgcatttgc 
tccttggtct 
ccaaaacagt 
gtacaatcgg 
gtggtgactc 
gtacgggtgg 
ccttgacaga 



agcagcagac 
tgctgaattg 
cactcgttac 
atactgggca 
tgcacacgca 
tcttgaaaac 
ggctatcctt 
aaaagctgat 
tatcgaaatc 
tgaaaaagca 
tggttacact 
tgacatcggt 
tgtatggaac 
tgtgatggac 
agctgccgca 
tggagcatca 
aaaataa 



60 

120 

180 

240 

300 

360 

420 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1197 



40 



<210> 66 
<211> 498 
<212> DNA 

<213> Streptococcus pneumoniae 



45 



50 



<400> 66 

atgttaaaat 

ttggaaggcg 

cgttttccta 

ggccccttcc 

gaggcagctc 

atttcttgtg 

ttacttggag 

gattatttgg 

tttgaggaaa 



cagaaaaaca 
aaaccaatgt 
ataccgtatt 
aaggaggtgt 
actttcagga 
atagtctagc 
ttctggatct 
aacaatttgt 
aatcttaa 



atcacgttat 
tttggctaat 
tgcaggcttt 
ttcctgcatc 
aactgttatt 
taaaagtgaa 
ggattcttca 
cgctattttg 



caaatgttaa 
ctttccaacg 
tatttgttcg 
cgtattgcac 
gttggagatg 
attgtggtgc 
gagattgagg 
cttgaaaaga 



atgaagaatt 
ccagtgctct 
atggaaagga 
taggcaaggg 
tgacgaccta 
cgatgatgaa 
attacgatgc 
cagcatggga 



gtccttccta 60 
cataaaatca 120 
attggtttta 180 
tgtttgtggt 240 
tctcaactat 300 
gaatggtcag 360 
tatggatcga 420 
ctttacgatg 480 
498 



<210> 67 
55 <211> 630 
<212> DNA 

<213> Streptococcus pneumoniae 
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<400> 67 

atgacaatcg aactattgac tccctttacc aaggtagagt tggagccaga aatcaaggag 60 

aaaaaacgca'aacaagttgg gattttaggg gggaatttta accctgttca caatgcccat 120 

5 ctcattgttg cggatcaagt acggcaacag ttgggactgg atcaagttct gctcatgcct 180 

gaataccaac ctcctcacgt tgataaaaag gaaaccatcc ctgaacacca tcgtctcaag 240 

atgcttgagt tggcaattga gggaattgac ggcctagtca ttgaaaccat tgagttggag 300 

cgcaagggta tttcctacac ctacgacacc atgaagattt tgacagagaa gaatccagat 360 

acggattatt actttatcat cggtgccgac atggttgact atctgcctaa gtggtaccga 420 

10 attgatgaac tggttgacat ggttcagttt gtgggggttc agcgtccacg ctacaaggta 480 

gggacttcct atccagttat ctgggtggac gtaccgctca tggatatctc gtccagcatg 540 

gtgcgtgcct tccttgccca acjgtcggaaa cccaactttc tcctacctca gccagtgcta 600 

gactacatcg agaaggaggg gctctactga * 630 

15 <210> 68 
<211> 768 
<212> DNA 

<213> Streptococcus pneumoniae 
20 <400> 68 

atgaatattg caaaaatagt cagagaagcg cgtgagcaga gtcgcttgac aaccttggac 60 

tttgcgacag gcatttttga tgaatttatc caattacatg gtgaccgttc ttttcgtgat 120 

gatggtgcag ttgttggtgg tattggttgg cttggagacc aagctgtaac agtggttggt 180 

atccaaaaag gcaagagttt gcaagacaac ctcaaacgga attttggcca accacatcca 240 

25 gaaggctacc gaaaggcact gcggttgatg aaacaggctg agaaatttgg ccgtccagtt 300 

gtgaccttta tcaatacagc aggtgcttat cctggtgtcg gagcggaaga acgtggtcaa 360 

ggggaagcta ttgctcgcaa tctcatggaa atgagtgacc tgaaagttcc tattatcgcc 420 

attattatcg gtgaaggtgg ttcaggcggg gctctggctc tagctgtcgc ggaccgtgtc 480 

tggatgctgg aaaattctat ctatgccatt ctcagtccag aaggctttgc ttccatttta 54 0 

30 tggaaggacg gtactcgcgc catggaagca gcagaactga tgaaaatcac ttcgcatgaa 600 

ctgttagaaa tggacgtggt ggataaggtg atttctgaag taggactttc tagtaaagaa 660 

ctgattaaga gtgtcaaaaa agaactccaa acggagctgg ctagactttc acaaaaaccg 720 

ctagaagagt tgctggaaga acgctatcaa cgatttagaa aatactaa 768 

35 <210> 69 
<211> 510 
<212> DNA 

<213> Streptococcus pneumoniae 
40 <400> 69 

atgattataa aagtagaaat ggcagatgtt gaggtgttgg ctaaaattgc caaacaaacc 60 
tttcgtgaaa cctttgcgta tgataatacg gaagagcagt tacaggaata ctttgaagag 120 
gcttatagtc tgaaaacttt gtcaactgag ttgggaaatc ctgactctga aacctatttc 180- 
attatgcatg aggaggagat agctggtttt ctcaaagtca actggggaag tgctcaaact 240 
45 gagagagaat tagaggacgc ttttgaaatt caacgcctct atgtgctaca aaaattccaa 300 
ggatttggac taggtaagca actgtttgaa ttcgcacttg aacttgctac aaaaaatagt 360 
ttttcttggg ct'tggctagg tgtttgggag cataatacaa aagctcaagc cttttataat 420 
cgatatggtt ttgaaaaatt tagccaacat cattttatgg ttggtcaaaa agtagatacg 4 80 
gattggttac tgagaaagaa attaaggtaa 510 

50 

<210> 70 
<211> 1590 
<212> DNA 

<213> Streptococcus pneumoniae 

55 

<400> 70 

atgttacggg ggactgcttt gctaacggct agtaacttta . tcagtcgcct actcggggct 60 
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gtttacatta tcccttggta catctggatg ggggcttatg cagctaaggc aaatggtctc 120 
tttaccatgg gttacactat ctatgcttgg ttcttgttgg tttcaacagc ggggattcca 180 
gttgcggtgg ccaagcaagt tgccaagtat aataccatgc gagaagaaga gcatagcttt 240 
gccctgattc ggagcttctt aggctttatg acaggactag gcctggtttt tgctttagtc 300 
5 ttgtatgtct ttgctccttg gctagcagac ttgtctggcg tgggcaaaga cttgatccca 360 
atcatgcaaa gcttggcttg gggagtcttg attttcccgt ctatgagtgt tatccgagga 420 
tttttccaag ggatgaataa cctcaaaccc tatgccatga gccaaattgc tgagcaggtc 480 
attcgtgtta tctggatgct cctagcaacc tttatcatta tgaagctcgg ttcaggagat 540 
tatctagcag ccgttaccca atcaaccttt gctgcctttg tcggtatggt agccagtttt 600 

10 gcagtcttga tttatttcct tgcccaagaa agttcactca aaagagtctt tgaaacagga 660 
gataagatta acagtaagcg tctcttggtt gataccatta aggaagccat tccttttatc 720 
ctgacagggt ctgccatcca gatcttccag attttggatc agctgacctt tatcaatagt 780 
atgagctggt ttaccaacta cagcaatgag gacttggttg tcatgttttc tta.tttctca 840 
gccaatccta ataaaatcac gatgattttg atttctgtag gggtttcgat tgggagtgtt 900 

15 ggtttgccac ttttgacgga aaactatgtc aagggggact tgaaagcggc ttctcgtctc 960 
gttcaggaca gtctcaccct actctttatg ttcttgctac cagcaacggt tggagtggtt 1020 
atggtaggag aacctcttta tacggtcttc tatggtaagc cagatagttt ggctctgggc 1080 
ttatttgtct ttgcagtttt gcagtctatt attttaggct tgtacatggt cttgtctcca 1140 
atgcttcagg ccatgttccg caaccgcaag gccgttctct attttatcta tggttctatt 1200 

20 gccaagctag tcttgcaact acctaccatc gccctcttcc acagttatgg tcctttgatt 1260 
tcaacaacca ttgctctcat cattcctaac gtcttgatgt atcgggatat ttgtaaagta 1320 
actggtgtca agcgcaaggt gattttgaag cgaaccattt taatcagttt gctgacccta 1380 
gtcatgtttc tgttaatagg aaccatccag tggctgttag gatttttctt ccaaccaagt 1440 
ggacgtttgt ggagcttctt ttatgtagct cttgtcggtg ccatgggggg tggactttat 1500 

25 atggttatga gtctgcgtac ctatttatta gataaggtaa taggaaaagc ccaagcagat 1560 
•cgcctgcgag caaaatttaa gctttcgtaa 1590 

<210> 71 

<211> 468 

30 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 71 

atgtcagata agattggctt attcacaggc tcatttgatc cgatgacaaa tgggcatctg 60 

35 gatatcattg aacgggcgag cagactcttt gataagctct atgtcggtat tttttttaat 120 

ccccacaaac aaggatttct tcctatcgaa aatcgtaaac gggggctaga aaaggctttg 180 

ggacatctgg aaaatgttga agtcgtggct tctcatgatg aattggtggt cgatgttgca 240 

aaaagattgg gtgctacttg tctagtgcgt ggtttgagga atgcgtcgga tttgcaatat 300 

gaagccagtt ttgattacta caatcatcag ctgtcttctg atatagagac tatttattta 360 

40 catagtcgac ctgaacatct ctatatcagt tcatcaggcg ttagagagct tttgaagttt 420 

ggtcaggata ttgcctgcta tgttcccgag agtatttgga ggaaataa 468 

<2i0> 72 
<211> 432 
45 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 72 

atgacgattt tgtttgtggt tatcagtgct tcctttctgt atatggtttc tcttagcatg 60 
50 aaaccctatc aaacagctaa aagtgaagga gaaaaattag ctcagcagta tgcaggatta 120 
gagcaggccg atcaggttga tttatacaat ggcttggaat cttattacag cgttcttggt 180 
cgtaataaac agcaagaagc acttgctgtt ctgattggaa aagatgatca taagatttac 240 
gtttatcagc taaatcaggg tgtttcacaa gaaaaagcag aaacggtttc taaggaaaag 300 
ggagctggcg aaattgacaa gattatcttt ggtcgttatc aagataagcc aatctgggaa 360 
55 gtcaagtcag gatctgattt ttatctagta gattttgaaa caggagcatt ggtcaacaag 420 
gagggcctat ga 432 
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<210> 73 
<211> 732 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 73 

atgattgata ttcattcgca cattgtcttt gatgtagatg atggtcccaa 
gaaagtaagg ctctcttgac agaagcctac aggcaggggg tgcgaaccat 
tctcaccgtc gcaagggcat gtttgaaact ccagaagaga agatagcaga 
caggttcggg aaatagctaa ggaagtcgcg agtgacttgg tcattgctta 
atttactaca cgccagatgt tttggataag ctggaaaaca atcggattcc 
aatagtcgtt atgccttgat agagtttagt atgaacactc cttatcgcga 
gcettgaata aaatattgat gttgggaatt actcccgtca ttgcccacat 
gatgttcttg aaaataatga aaaacgcgtt cgagagctga tcgatatggg 
caaataaata gttcacatgt cctcaaatcc aaactttttg gagaacctta 
aaaaaaagag cgcagtattt cttggagcgt gatttggttc atatcattgc 
cataatgtgg acggcagacc cccccatatg gcagaagcat atgaccttgt 
tacggagaag cgaaggctca ggaacttttt atagacaatc ctcgaaaaat 
caactaattt ag 

<210> 74 
<211> 927 
.<212> DNA 

<213> Streptococcus pneumoniae 
<400> 74 

atgtctacaa tcgataaaga aaaatttcag tttgtaaaac gtgacgattt 
actattgatg cgccagcata ttcttactgg aaatcagtgt ttaaacaatt 
aaatcaactg tagtcatgtt gggaatcttg gtagccatca ttttgataag 
ccaatgtttt ctaagtttga tttcaatgat gtcagcaagg taaacgactt 
tatatcaagc caaatgcgga gcattggttc ggtactgaca gtaacggtaa 
gacggtgtct ggttcggagc tcgtaactcc atcctcattt ctgtgattgc 
aacttggtta tcggtgtttt tgtcggtggt atttggggta tttcaaaatc 
gtcatgatgg aagtttacaa cgtcatctca aacatcccac ctcttttgat 
ttgacttact caatcggagc tggattctgg aatctgattt ttgccatgag 
tggattggta ttgccttcat gatccgtgtg caaatcttgc gctatcgtga 
aacttggcgt cacgtacttt gggaacacca accttgaaga ttgttgccaa 
cctcaattgg tatctgttat tgtgacaacc atgactcaaa tgcttccaag 
tacgaagcct tcttgtcttt cttcggtctt ggattaccga ttacagtgcc 
cgtttgattt cggattattc acaaaacgta acaaccaatg cttacttgtt 
ttgacaaccc ttgtcttggt atccttgtcc cttttcgtag ttggtcaaaa 
gctagtgatc cacgtacaca tagatag 



gtcaagagag 60 
tgtctctacc 120 
aaactttctt 180 
tggggctgaa 240 
gaccctcaat 300 
tattcatagt 360 
a gage get at 420 
ctgttacacg 480 
taaattcatg 540 
aagtgatatg 600 
ttcccaaaaa 660 
tgtaatggat 720 
732 



tgcctctgaa 60 
tatgaagaaa 120 
tttcatctac 180 
tagtgttcgt 24 0 
ategctcttt 3.00 
gacagtgatt 360 
agttgaccgt 420 
tgttattgtc 480 
cgtaacaaca 54 0 
cttggaatac 600 
aaatatcatg .660 
ctttatctca 720 
aagtttgggt 780 
ctggattcca 840 
ettageggat 900 
927 



<210> 75 

45 <211> 234 

<212> DNA 

<213> Streptococcus pneumoniae 



<400> 75 

50 atgtataacc tattattaac cattttatta 

ttcatgeaac caaccaaaaa ccaatccagc 

tttgaacgea gtaaagctcg cggttttgaa 

gtctttttct ggctagecat tgecttagea 

55 <210> 76 

<211> 1110 
<212> DNA 



gtattatctg ttgtgattgt gattgeaatt 60 

aatgtatttg atgccagttc aggtgatttg 120 

gctgtaatgc agegtttgae agggatttta 180 

ttgaeggtat tatcaagtag ataa 234 
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<213> Streptococcus pneumoniae 
<400> 76 

atgtttcgta gaaataaatt atttttttgg accacagaaa ttttactctt aaccatcatc 60 
5 ttttacctat ggagacagat gggatctttg attaaccctt ttgttagcgt gcttaataca 120 
attatgattc catttttatt agggggcttt ctttattatt tgacaaaccc tattgttact 180 
ttcttaaata aagtctgtaa actcaatc'gt ttgcttggta ttttaattac cttgtgtact 240 
ttggtctggg gaatggtcat aggtgttgtc tatctcttac ctattttgat taatcagtta 300 
tctagtttga ttatatctag tcaaactatt tatagtcgag tacaagactt aatcatagac 360 

10 ttatctaatt atcctgcgct ccagaatttg gatgtagaag ctacaattca gcagttaaac 420 
ttatcctatg ttgatattct tcaaaatatc ctaaatagcg tatcaaatag tgtggggagc 480 
gtcttgtcag ctcttatcag tactgttttg attttgatta tgactccagt ttttttggtt 54 0 
tatttcttat tagatggaca taaattcttg cccatgcttg aaagaacgat tctaaagagg 600 
gatcgcttgc atattgcagg cttattaaag aatttaaatg cgacgattgc tcgctatatt 660 

15 agtggagttt cgattgacgc aatcattata ggttgtttgg cttatattgg ctatagtatt 720 
attggtttaa aatatgcttt agtttttgcc attttttctg gtgtagccaa tttaattcct 780 
tatgtggggc caagtattgg tttgattcct atgatcatcg caaatatatt cactgtaccc 840 
catagactgc tgattgcagt gatttatatg cttgttgttc agcaggtaga tggcaatatc 900 
ttatatcctc gaattgtagg aagtgttatg aaggttcatc caatcacgat tttagtttta 960 

20 cttttgttgt caagcaatat ctatggtgta gttggaatga ttgtcgcagt gccaacctat 1020 
tctatcttga aagaaatttc taagttctta tcccgtttgt atgaaaatca taaaataatg 1080 
aaagaacgag aaagagaatt agctaagtaa 1110 

<210> 77 
25 <211> 1356 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 77 

30 atgtatcaag cactttatcg aaaatataga agtcaaaact tctcccagtt agttggtcaa 60 
gaagttgtgg ctaagactct taaacaagcg gtggagcaag agaaaataag tcacgcttat 120 
cttttttctg gtcctcgtgg aacgggaaaa accagtgttg ctaaaatctt tgccaaggct 180 
atgaactgtc ccaatcaagt gggtggcgaa ccttgcaata actgctatat ttgtcaagca 240 
gtgacggacg gtagtttaga agatgtcatt gaaatggatg cagcttctaa taatggggta 300 

35 gatgaaattc gcgaaattcg tgataaatct acctatgcgc ctagccttgc tcgttataag 360 
gtttatatca tagatgaggt tcacatgctg tctacagggg cttttaatgc cctcctaaag 420 
acgctggaag aaccaacaca gaatgtagtc tttattttgg ccactactga attgcacaag 480 
attcctgcta ctattctatc ccgtgtgcaa cgttttgagt ttaaatcaat taagacacag 540 
gatattaagg aacatattca ctatatctta gaaaaagaaa atatcagttc tgaaccagag 600 

40 gctgtggaaa tcattgccag acgggcggaa ggtggaatgc gggacgcctt gtctattttg 660 
gatcaagccc tgagtttgac acagggaaat gagctgacga ctgctatctc tgaagaaatt 720 
actggcacca ttagcctatc agccttggat gattatgtgg cggccttgtc tcaacaggat 780 
gttcccaaag. ctttgtcttg cttgaatctt ctttttgaca atggtaagag catgactcgt 840 
tttgtgaccg atcttttgca ctatttaaga gacttgttaa ttgttcaaac agggggagaa 900 

45 aatactcatc atagttcagt ctttgtagaa aatttggcac ttcctcaaaa aaatctgttt 960 
gaaatgattc gcttagcaac agtgaattta gcagatatta agtctagttt gcagcccaag 1020 
atttatgctg aaatgatgac cgtccgtttg gcggaaatca agcccgaacc agctctatca 1080 
ggagcggttg aaaatgaaat tgctacgctg agacaggaag ttgcccgtct caaacaagag 114 0 
ctttctaatg caggtgcggt tcctaaacaa gttgcaccag ctcctagtcg accagctacg 1200 

50 ggcaaaacag tctatcgtgt cgatcgcaat aaagtgcaat ctatcttaca agaggccgtc 1.260 
gaaaatcctg atttaacacg tcaaaatcta attcgtttgc agaatgcatg gggagaggta 1320 
attgaaagtc taggtgggcc ggacaagctc tgctag 135 6 

<210> 78 
55 <211> 1989 
<212> DMA 

<213> Streptococcus pneumoniae 
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<400> 78 

atgtttcgat taaccaataa gttagcggta 
tatccttttg cgctggctgt tctcttggca 
5 accttcaatc ctaagattgc ggaaatccgt 
tttggtatgt ttgtcgtcac ccttgcgtca 
gtcatgaaga aacgttccaa ggaactagga 
catcttatca gtatgacctt taaggagtta 
ggtatcggta ttggagcctt gtttgacaag 

10 aaattgaagg ttgagctggt tgctaccttc 
gtcttcggtt tgattttcct aggcctcatg 
aatgccctcc agctctctcg tgagaaagct 
ctccaaacca ttcttggttc cataagttta 
aaagatcctc ttacagcctt aacaaccttc 

15 acttatctct tgtttaatgc agggattacc 
aaatactatt accaaccaaa taacctcata 
aaaaatgcag ttggactagc aactatcgct 
tcagcagcga caagcatttt caattccgca 
gattttgggg tttcaggaca aaatgttgaa 

20 tttgcaagtg acaatggtta taagattaaa 
ggtgttgcga accaagaagg aaataagtta 
caacccacaa cagttttcat ggtatttgac 
aaactgtctc tatcaggaaa tgaggtcggt 
cagaaaactc taattctgaa tgatcatcaa 

25 tttattgtca accatgtccc aaatcagttt 
gttgtacctg atttacaagc ctttttgaac 
ttttacggtg gtatgaatgt aaatgtcagt 
tatgaaaact acctcaatca atttaatgct 
ggtagcaatc tagcagatgc tagttctcag 

30 atcggtattt tcctatccat tatctttatg 
caaatttctg aaggctacga agaccgtgaa 
gaccaaaagc aaatcaagca aaccatccac 
ttgctctttg ccttcataca tctcgccttt 
gtgattggtg tactggatac gactatgatg 

35 ttcctcatcg cctatgtgct gattttcatg 
caaatgtaa 

<210> 79 
<211> 891 
40 <212> DNA 

<213> Streptococcus pneumoniae 

<4O0> 79 

atgaaacaag atcaactaaa ggcttggcaa 
45 ttagaacaag accagctcaa tcacgcctat 
atggcgcaat ttttagctaa gagcctcttt 
gagaaatgcc gaagttgcaa gctgattgaa 
aagccagtca atcaggtcat caagacagaa 
caagcaggga ttgaaagcca gcaacaggtc 
50 cccaacgcag ccaattctct gctcaaggtc 
ttcttcttga ctagcgatga ggaaaagatg 
ttccacttta aaaagcaaga agaaaaactt 
aagaaaaaag cgactctttt agctaagttt 
gctaatcagg caagtttttg gaccttggtc 
55 gtagctaaga aaaaagaaag ttatctacag 
aaggaaaaac aggatcaggt tttacggatt 
caggtaagag taagagtgat tctacaagat 



PCT/USOO/35604 



tcgaacttga ttaaaaaccg caaactctac 60 
gtcactctca cctatctctt ttactctcta 120 
ggaggaacaa ccattcaggc tacacttgga 180 
gccattatcg ttctctatgc caatagtttt 240 
atttatggca tgttgggctt ggagaagcgt 300 
gtggtatttg ggattctaac tgttggagcg 360 
ttaattttcg ctttcctgct caaactaatg 4'20 
cagacgaaag ttgtcattac agtgcttgtt 480 
ttcctgaatg cccttcgaat cgcccgtatg 54 0 
agtggagaga aaaaaggtcg cttccttect 600 
ggaattggct attatcttgc ccttacggta 660 
ttcatagctg ttttactggt tatctttggg 720 
gttttcctcc aaatcttaaa gaaaaataag 780 
tctgtttcta acttgatttt ccgtatgaag 840 
attttgtcaa caatggtttt ggtaaccatg 900 
gaatccttta aaaaagttct aaatcctcat 960 
aaagaagatt tggacaaact cttgagccag 1020 
gaaaaagaag tgtttcgtta cacttacttt 1080 
accttttttg aaaaaggaca aaatcgtgtc 1140 
caaaaagatt atgaaaatat gactggtcaa 1200 
ctctttgcca aaaatgacgg actgaaagga 12 60 
ttttctgtaa aagaagaatt taataaagat 1320 
aatattttga ctgctgatta caattacctt 1380 
caattcccag attcggatat ctataatcag 1440 
gaagaagaac aactcaaggt cgctgaggag 1500 
caattagaca cagaaggtag ctatgtttat 1560 
atgagtgccc tctttggtgg tgtcttcttt 1620 
gtcggaactg ttctggtcat ctactacaaa 1680 
cgctttatta tcttgcagaa agtcggtttg 1740 
aaacaggttt taactgtttt cttccttect 1800 
gcctaccata tgettagect gattttaaaa 18 60 
ttgattgtga ccttgtctat ctgcgctatc 1920 
attacttcaa gaagttatcg caagattgtg 1980 

1989 



ccagctcagt ttgaccgttt tgtcegtate 60 
ctcttttcag gtttctttgg aagcttggaa 120 
tgtaeggata aagttggcgt cttaccatgt 180 
caggaagagt ttccagatgt caccttgatt 240 
cgcattcggg aattggtggg acagttttct 300 
tttattatcg ageaagegga taaaatgeat 360 
atcgaagaac cccagagtga agtttatatt 420 
ttaccgacaa tccgaagtcg gactcagatc 480 
atcttactct tagaacaaat gggacttgtt 540 
agtcaatege gagctgaagc agaaaagttg 600 
gatgaaagtg aacgcctgct gacttggtta 660 
gttgccaaat tagecaaett ggcagatgat 720 
cttgaagttc tctgtgggca ggacctcttg 780 
ttactagaag ctagaaaaat gtggcaagct 840 
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aatgtcagct ttcaaaatgc catggaatat ctggtcttga aagaaatata a 



891 



<210> 80 
<211> 615 
5 <212> DNA 

<213> Streptococcus pneumoniae 



10 



15 



20. 



25 



30 



35 



40 



45 



50 



55 



<400> 80 

atgaattcat ttaaaaattt cttaaaagag tggggactgt 
ctagctttaa gtcgta.tctt tttttggagc aatgttcgcg 
ccgaccctag cggatggcga aattctcttc gttgtaaaac 
gatatcgtgg tggcccatga ggaagatggc aataaggaca 
atgcctggcg acaccattcg ttacgaaaat gataaactct 
gacgagcctt atctagcaga ctatatcaaa cgcttcaagg 
tactcaggca agggctttga aggaaataaa ggaactttct 
gcccaagcct tcacagttga tgtcaactac aacaccaact 
ggagaatacc ttctcctcgg agatgaccgc ttggtttcga 
accttcaaag caaaagatat cacaggggaa gctaaattcc 
atcggaacat tttaa 

<210> 81 
<211> 987 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 81 

atggtagtat ttacaggttc aactgttgaa gaagcaatcc 
gatattccaa gaatgaaggc tcatatcaaa gtcatttcta 
ggtctatttg gtaaaaaacc agcccaagtg gatattgaag 
gtcaaagcaa atcaacaggt agtaaaaggc gttccgaaaa 
cctgtgaaga cggttagtga agaaaccgtt gaccttggtc 
aaaatagagg aagaaggtca aggtatttct gatgaagtca 
gaaagacatg ccagcactat cttagaagaa actggtcaca 
caaatcgagg aagcgatgag ggaagaagca ggcgctgatg 
caaactgaaa atcaagactt gaaagagatg ggcttgaagg 
gcccaggtgg ctacggatgt gactgcctat gttcaagcga 
gaagctacac tttcaaatga ttataaccgt cgtagcatca 
gaaccaggtc gtattatcgg ctaccatggt aaagtcttga 
caaaattatc tttacaaccg ctattccaaa accttctacg 
tatgtcgaac accgtgcaga agtcttgcag acctatgcgc 
ttggaagaag gtcgcagtca taaaacagat ccaatgtcaa 
catcgtatta tttcacgtat ggatggcgtg actagttact 
cgctatgttg ttgtagatac agaataa 

<210> 82 
<211> 1383 
<212> DNA 

<213> Streptococcus pneumoniae 



tcctcctaat 
tagaaggaca 
accttcctat 
tcgtcaagcg 
acatcaatga 
atgacaaact 
ttagaagtat 
ttagctttac 
gcgacagccg 
gcttctggcc 



agaaaggatt 
gggagaaaaa 
cgattagtga 
aaatcaatga 
atgtggttaa 
aggctgaaat 
ttgagatttt 
accttgaaac 
tcgagcaaag 
ttgtggatga 
atctacaaat 
aggccttgca 
ttacaatcaa 
aaaaattggc 
atagcgaacg 
ctgaaggtga 



tctgtcatta 60 
ttccatggat 120 
tgaccgtttt 180 
cgtcjattgga 240 
caaagaaacg 300 
ccaaagcact 360 
cgctcaaaaa 420 
tgttccagaa 480 
ccacgtaggt 54 0 
aatcacccgt 600 
615 



gaaagaatta 60 
aggctttctt 120 
aacgactgtt 180 
tttgaacgag 240 
tgctattaaa 300 
cttaaaacat 360 
aaatgaactt 420 
tgagcaagat 480 
ttatgatatt 540 
catggatgtt 600 
tgacaccaac 660 
actgttggct 720 
tgtcaatgat 780 
gaatcgtgtt 84 0 
caagattatc 900 
tgagccaaat 960 
987 



<400> 82 

atgtcaaatt, 

ccaaaagttt 

ggagctatcc 

gaggtcttgg 

gcagttatga 

ggagatactc 

cataaaaatg 



ttgccattat 
tgcacaaggt 
aacctgaaaa 
ctggacagac 
tgacagaacc 
ctttaatcac 
tggccactat 



tttagcagcg 
tgcgggtatt 
gacagtaaca 
agaatttgtg 
tatcttagaa 
tggtgaaagc 
cttgactgct 



ggtaaaggga 
tctatgttgg 
gttgtaggac 
actcaatctg 
ggtgtgtcag 
ttgaaaaact 
gaaacggata 



ctcgcatgaa 
aacatgtttt 
acaaggcaga 
aacagttggg 
gacacacctt 
tgattgattt 
atccttttgg 



atctgatttg 60 
ccgtagtgtg 120 
attggttgag 180 
aactggtcat 240 
ggtcattgca 300 
ccatatcaat 360 
ctatggacga 420 
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attgttcgta atgacaatgc tgaggttctt cggtcattgt tgagcagaag gatgctacag 480 

attttgaaaa gcaaatcaag gaaatcaaca ctggtaacat acgtctttga caacgagcgt 540 

ttgtttgagg ctttgaaaaa tatcaatacc aataacgctc aaggcgaata ctatattaca 600 

gacgtcattg gtattttccg tgaaactggt gaaaaagttg gcgcttatac tttgaaagat 660 

5 tttgatgaaa gtcttggggt aaatgaccgt gtggcgcttg cgacagctga gtcagttatg 720 

cgtcgtcgca tcaatcataa acacatggtc aacggtgtta gctttgtcaa tccagaagca 780 

acttatatcg atattgatgt tgagattgct ccggaagttc aaatcgaagc caatgttatc 840 

ttgaaagggc aaacgaaaat tggtgctgag actgttttga caaacggtac ttatgta.gtg 900 

gacagcacta tcggagcagg agcggtcatt accaattcta tgattgagga aagtagtgtt 960 

10 gcagacggtg tgacagtcgg tccttatgct cacattcgtc caaa.ttcaag tctgggtgcc 1020 

caagttcata ttggtaactt tgttgaggtg aaaggatctt caatcggtga gaataccaag 1080 

gctggtcatt tgacttatat cggaaactgt gaagtgggaa gcaacgttaa tttcggtgct 1140 

ggaactatta cagtcaacta tgacggcaaa aacaaataca agacagtcat tggagtcaat 1200 

gtctttgttg gttcaaattc aaccattatt gcaccagtag aacttggtga caattccctc 1260 

15 gttggtgctg gttcaactat tactaaagac gtgccagcag atgctattgc tattggtcgc 1320 

ggtcgtcaga tcaataaaga cgaatatgca acacgtcttc ctcatcatcc taagaaccag 1380 

tag 1383 

<210> 83 
20 <211> 936 

<212> DNA . - 

<213> Streptococcus pneumoniae 

<400> 83 

25 atgtccaaga ttctagtatt tggtcaccaa aatccagact cagatgccat cggatcatct 60 
gtagcttttg cctaccttgc aaaagaagct tacggtttgg atacggaagc tgttgccctt 120 
ggaactccaa atgaagaaac agcctttgtc ttgaactatt ttggtgtgga agcaccaqgt 180 
gttatcactt ctgccaaagc agagggggca gagcaagtta tcttgactga ccacaatgaa 240 
ttccaacaat ctgtatcaga tatcgctgaa gtagaagttt acggtgttgt agaccaccac 300 

30 cgtgtggcta actttgaaac tgcaagccca ctttacatgc gtttggagcc agttggatca 360 
gcgtcttcaa tcgtttaccg tatgttcaaa gaacatggtg tagctgttcc taaagagatt 420 
gcaggtttga tgctttcagg tttgatttca gatacccttc ttttgaaatc accaacaaca 480 
cacccaacag ataaaatcat tgctcctgaa ttggctgaat tggctggtgt aaacttggaa 540 
gaatatggtt tggcaatgtt gaaagctggt accaacttgg ctagcaaatc tgctgaagaa 600 

35 ttgattgaca tcgatgctaa gacttttgaa ctcaacggaa ataatgtccg tgttgcccaa 660 
gtgaacacag ttgacatcgc tgaagttttg gaacgccaag cagaaattga agctgcaatg 720 
caagctgcca acgaatcaaa cggctactct gactttgtct tgatgattac agatatcgtc 780* 
aactcaaact cagaaatatt ggctcttggt gccaatatgg acaaggtcga agcggctttc 840 
aatttcaaac ttgaaaacaa tcatgccttc cttgctggtg ccgtttcacg taagaaacaa 900 

40 - gtggtacctc aattaactga aagctttaat acgtaa 936 

<210> 84 

<211> 678 

<212> DNA 

45 <213> Streptococcus pneumoniae 

<400> 84 

atgatttcaa agagattaga attggtagct tcctttgtgt cacagggggc tattttacta 60 
gatgtgggaa gtgaccatgc ttatctgcct atcgagttgg ttgagagagg ccaaatcaaa 120 

50 agcgctattg caggtgaggt ggtggaaggt ccctatcagt ctgcggttaa aaatgttgag 180 
gctcacggcc taaaggagaa aatccaagtc cgtttagcca atggcttggc agcttttgaa 240 
gagactgacc aagtgtctgt cattaccatt gctggcatgg gtggtcgttt gattgctagg 300 
attttagaag aaggtttggg gaagttagct aatgtagagc gtttgatcct ccagcccaat 360 
aatcgtgaag acgacttgcg tatctggcta caggatcatg gattccagat tgtagcagaa 420 

55 agcatcttag aagaagctgg aaagttttat gagattttgg tggtggaagc aggacaaatg 480 
aagctatcag ccagtgatgt tcgctttggt cccttcttgt ccaaagaagt cagtccagta 540 
tttgtccaaa aatggcaaaa agaagctgag aagctagagt tcgccctcgg acaaatccca 600 



33 



WO 01/49721 



POT/USOO/35604 



gaaaaaaatc tg.ga.agaacg tcaagttcta gtagataaga ttcaagctat paaggaggtg 660 
ct ccatgfct a* ^gca^gj^ga 678 

<210> 85 V ^ . : 
<211> 486 
<212> DNA 

<213> Streptococcus pneumoniae 



<4 00> 85 

atgaatttaa 

ttttcttata 

cctgaagttg 

cetac^§ctg 

gtggctagtg 

ggaccagata 

gtaattatcg 

acggaaattc 

aaatga 



acgatattaa 
aaaatgggac 
caactcaagt 

agggaaatct 
aacctgcctt 
aagccat-gaa 
tcgtctctaa 



agacttgatg 
ggatgagttg 
cgctccagca 
gactgtagca 
tgtagagagt 
cgttacagtt 
agtcatgaat 
cgaagaaatg 



actcaatttg 
cagtttagca 
cccgttctag 
gaagaagttc 
ccacttgttg 
ggtgatagtg 
gaaatcccag 
gttgagtttg 



accagtcaag 
agaatgaagc 
caacaccgag 
cagctccagc 
gagtggttta 
tcaaaaaagg 
ctcctaagga 
gtaaaggatt 



tttgagagaa 60 
gagacctgtg 120 
tccaojfagct 180 
tgaag'caagt 24 0 
cttggctgct 300 
tcaaacattg 360 
tggtgtggta 420 
ggtacgtatc 480 
486 



<210> 86 
<211> 1236 . 
<212> DNA 

<213> Streptococcus pneumoniae 



<400> 86 
atgaaactaa 
ccagaagaat 
tttgatcata 
aaatactttg 
gcagcccaag 
tttggtgtta 
cgccttcatg 
aatatggctt 
aatactgcct 
ggtttccaag 
gctggtttcc 
ccatttgata 
gaaagtcttg 
ggaaatactt 
•aaggccatca 
aatgctcacg 
gttcttggta 
gctgcgggtg 
atgacagctg 
ggcttggaga 
gcagttcttg 



atcgagtagt 
tttggaatag 
gtgactttga 
taaaaaaaga 
aggctgtaaa 
tcgttgcatc 
aaaaaggacc 
ctgggaatgt 
gctcttcatc 
atgtgatgtt 
aagccttaac 
aggatcgcaa 
aacacgctga 
gtgatgccta 
aactagcctt 
gaacgtcaac 
aggaagtacc 
cagtagaagc 
ggacaagtga 
aagaaattcc 
ctttcaaacg 



ggtaacaggt 
tttagcaact 
tgtgcataat 
taccaaccgt 
tcatgccaat 
tggtattggt 
caaacgtgtc 
agccatgcgt 
aaatgatgcg 
ggtgggagga 
agctctctct 
tgggtttgtt 
aaaacgtgga 
ccacatgact 
ggaagaagct 
tcctgccaat 
tgtatcatca 
tatcgtcacc 
agtatcagat 
atacgctatt 
ttgggagaat 



tatggagtaa 
gggaaaatcg 
gcggcagaaa 
tttgataact 
cttgatgtag 
ggaatcaagg 
aaaccaatga 
tttggtgcaa 
attggggatg 
acagaagctt 
actacagagg 
atgggtgaag 
gctactatcc 
tctccacatc 
gagatttctc 
gaaaaaggag 
accaagtctt 
atcgaagcta 
tatatcgaag 
tcaaatactt 
agataa 



catctccaat 
gcattggtgg 
tccaagattt 
attctttata 
aggctcttaa 
aaattgaaga 
ctcttccaaa 
acggtgtttg 
ccttccgctc" 
ctatcacacc 
atccaactcg 
gttcagggat 
tggctgaagt 
cagaaggtca 
cagagcaagt 
aaagtggtgc 
ttacaggaca 
tgcgtcataa 
ctaatgtcgt 
ttggttttgg 



cggaaataca 
cattacaaaa 
tccgttcgat 
tgccttgtat 
tagggatcgt 
tcaggtactt 
agctttacca 
taaatctatc 
cattaagttt 
ttttgccatc 
tgcttcgatc 
gttggttcta 
ggttggttac 
gggagctatc 
agcctatgtc 
tatcgtagct 
tttgctgggg 
ctttgtacca 
ttatggacaa 
aggccacaat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1236 



<210> 87 
<211> 1080 
<212> DNA 

<213> Streptococcus pneumoniae 



<400> 87 

atgaacatct atgatcaact acaagttgta 
ctgagtgacc ctgatgtcgt ttcagacacc 
gcttccaatc gtgacaccgt aatagcctac 
gtcgatgccg aagagatgat taaggaatca 



gaagaccgtt atgaagaatt aggagaattg 60 

aagcgtttta tggagctttc aaaagaagaa 120 

cgtgagtata aacaagtcct tcaaaatatc .180 

ggcggagatg cggacttgga agaattggcc 24 0 
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aagcaagaac tcaaagatgc caaggctgaa 
ttgctccttc caaaggatcc aaacgatgac 
gctggtggag acgaagcggc acttttcgct 
gcggaagccc aaggttggcg ctttgaagtc 
5 tttaaagaag tggttgctat ggtttcaggt 
tcaggtgccc accgtgtgca acgtgttcct 
tcgacagcga cagttcttgt tatgccagaa 
aaagaccttc gtgtcgacat ctatcacgcc 
gttgcgactg ccgttcgtat cgttcacttg 
10 gaacgtaccc agcagaagaa ccgcgagaag 
gaccactttg ctcagattgc tcaggatgaa 
actggtgacc gttcagaacg gatccgaact 
caccgtatcg gcttgaccct ccaaaaacta 
gttgtggatg ccttggtgct ttatgaccaa 

15 

<210> 88 
<211> 1680 
<212> DNA 

<213> Streptococcus pneumoniae 

20 

<400> 88 

atggcctaca ctcttaaacc tgaagaagtt 
atcgggaaaa acacttacgg aattgaatac 
attaaattcc cagaagatga cttgcttggt 

25 atcgtagaca atatcgaccg cgtcaaggct 
attggtggga ttccgttcct actcaagcaa 
gccttggctt tgatccgtgg gaaactcgaa 
tacgaaatca accacaacac cgagttgacc 
acgactcact ctattccaga gcctttgggg 

30 gtctgtacgg gtgactttaa gttcgacttt 
cgtatggctg cgcttggtga agaaggcgtg 
gaagtaccaa cctttaccaa ctctgaaaaa 
caaggtattg aaggacgtat catctttgca 
caggcaacag aagctgctgt taagactgga 

35 gaaaaggcca ttgtcaacgg ' aatcgatctt 
atcgagccaa atgaaatcaa agattatcct 
agtcagggtg agcctatggc agccctctct 
caattacaac caggtgatac cgttatcttc 
agcgtcaaca agctgattaa catcatttct 

40 gtgaacaata tccatacatc tggacacggt 
ttgattaagc caaaatactt catgcctgtc 
gctggactag cagtggatac tggtgttgag 
gatgtgcttg cccttactgc tgactcagct 
atctatgtcg atggaaatcg tatcggtgaa 

45 gatctatctg aagacggtgt cgttctggca 
attctgtctg gcccagatat cctcagccga 
ttgattcgcc aaagccagcg tatcctcttc 
gatgctagcg tgcaatctgt caatggtgcc 
gaaaataccg aacgtgaace gatcatcatc 

50 

<210> 89 
<211> 1362 
<212> DNA 

<213> Streptococcus pneumoniae 

55 

<400> 89 

atggcagaag tagaagagtt acgagtacaa 



aaagaagaat atgaagaaaa actgaaaatt 300 
aagaatatca tccttgaaat ccgtggagca 360 
ggagatttgc taactatgta ccaaaagtat 420 
atggaagcct ctatgaatgg tgtcggtggt 480 
cagtctgtat actctaagct taagtatgaa 540 
gtgacagaaa gccaaggccg tgttcatact 600 
gttgaagagg ttgaatacga cattgatcca 660 
tctggtgctg gtggacagaa cgtcaataag 720 
ccaaccaata tcaaggttga gatgcag'gaa 780 
gctatgaaga ttatccgtgc acgcgtcgct 840 
caagacgctg agcgtaagtc gacaatcggt 900 
tataacttcc cacaaaaccg tgtcacagac 960 
gatacgattt tgtctggtaa attggacgaa 1020 
acacaaaaac tagaagaatt aaacaaataa 1080 



ggtgtttttg ccatcggtgg tctaggagaa. 60 

caagacgaga ttatcatcgt cgatgctggg 120 

atcgactatg tcattcctga ctactcttac 180 

gttttaatca cacacggaca cgaggaccac 24 0 

gcaaatgtcc ctatttatgc tggaccgctt 300 

gaacacggcc tcttgcgcaa cgccaaactt 360 

tttaaaaatc tcaaggcaac tttctttaga 420 

attgtcattc atactcctca agggaaaatc 480 

actccagttg gagaacctgc ggacttgcat 540 

ctctgtctcc tgtctgactc gacaaatgcg 600 

gtcgttggtc agtccattat gaagattatc 660 

tcctttgcct caaatatctt ccgtctccag 720 

cgcaagattg cggtctttgg tcgttctatg 780 

ggctacatca aagctcctaa gggaaccttt 840 

gcaggagaag ttcttatcct ctgtacaggt 900 

cgtatcgcca acggaaccca ccgtcaagta 960 

tcttctagtc ccatccctgg aaacact'act 1020 

gaagctggtg tcgaagttat ccacggtaaa 1080 

ggtcagcaag agcaaaaact catgctctgc 1140 

cacggtgaat accgcatgca aaaagtccac 1200 

aaggacaata tctttatcat gagcaatggc 1260 

cgtatcgcag gtcatttcaa cgcccaagat 1320 

attggcgcag ctgtcctcaa agatcgtcgc 1380 

gtcgcaactg ttgacttcaa atcgcagatg 14 4 0 

ggctttgtct acatgagaga gtctggagac 1500 

aatgccattc gtatcgcact gaaaaataag 1560 

attgtcaacg ctattcgccc cttcctctat 1620 

ccgatgatcc tcacaccaga tgaagaataa 1680 



cctcaagata tcttagctga gcaatccgtt 60 
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ttaggggcta tctttattga tgagagtaaa 
cgggactttt ttaagtatgc ccatcgtttg 
cgtggtgatg ccatagatgc aacaacggtt 
cagaatattg gtggcttgtc ttacttggtt 
5 aatgcggagt attatgctaa gattgttgca 
aagttgacag agtctgtcaa ccaagcttac 
gctcaggcag aaaaagggtt gattgatgtc 
aacattcgag atgtgttgaa tctcaacttt 
accgatatta caggtattgc gacaggttat 

10 catgaggagg agttgattat cttagcagct 
ttgaatatcg ctcagaacat tgggactaag 
gaaatgggtg cggaaagctt ggtagatcgt 
cattctatcc gtacagggca attgacagat 
ggtaatctag ctaacgccag tatctatatc 

15 attcgttctc gttctcgtaa attggctcaa 
gactatttgc aacttatcac gggaactggt 
atttctcgtc agttgaaaat actagccaag 
cagctttctc gtggtgtaga acaacgtcag 
gaatctgggt ctattgagca ggacgctgat 

20 tatgaacgtg gtggtgaaga agaggagggt 
gagaaaaacc gtagtggagc tcgtggaaca 
aaattcttca agtattcaag tatctcaaag 



cttgtttttg tgcgagaata cattgagtcfe 120 

attttccaag ccatggtcga tttatccgat 180 A 

cgtactatcc ttgataatca aggtgattta 240 

gagattgtta attctgtgcc aacttctgct 300 

gaaaaagcaa tgctacgtcg tttaattgcc 360 

gaagcgtcac aaccagctga tgaaattatt 420 

agtgaaaatg caaatcgaag cgggtttaag 4 80 

ggaaatctgg aagctcgctc gcaacaaacg 540 

cgtgatttgg atcatatgac aacaggactt 600 

cgtccagcag ttggtaagac agcatttgcc 660 

ttggacaaaa cggttgctat tttttcactc 720 

atgttagctg cagaaggctt ggtggagtca 780 

gaggagtggc aaaaatatac tattgctcag 840 

gatgatacgc caggtattcg gattacagag 900 

gaaactggaa atcttggttt gattgtgata 960 

cgagaaaatc gtcaacaaga agtttctgaa 1020 

gaattgaagg ttccagtaat cgctctgagt 1080 

gacaacjagac cggtcttgtc tgatattcgt 114 0 

atcgtagctt ttctctatcg cgatgactac 1200 

atcccaaata ataaggtgga agttattatc 1260 

gtggaattga ttgtccaaaa agaatacaat 1320 
agggaggcat aa 1362 



<210> 90 
25 <211> 693 
<212> DNA 

<213> Streptococcus pneumoniae 



<400> 90 

30 atggcgtata aatatttagt gattgtagaa tcacctgcaa aagccaagac aattgagaaa 60 

tatcttggac gaaactataa agtaatggcc agtgttggcc atataagaga tttaccgaaa 120 

agtaaaatgg gtatcgattt tgaaaacaat tatgaacccc attatatttc tatacgcgga 180 

aaaggcgatg tcatcaaaag cctgaaagcc gcagcaaaaa aagctcaaaa agtttacttg 24*0 

gcaagtgacc cggatagaga aggagaagcg attgcttggc atttagcgta cctacttggg 300 

35 ttggatctga aagaaaaaaa tcgggtggtc ttcaatgaaa tcacaaaaga cgcagtcaaa • 360 

gcagctttta aggaaccaag aacgatcgat gtagatttag tagatgcaca gcaagctcgt 420 

cgtaccttag acagaatcgt tggttattcg atcagtccta ttctctggcg taaggtcaag 480 

aaagggttaa gtgcaggacg tgtccaatct gtcgctttaa aaattattat tgaccgtgaa 540 

aaagagatcc gagaatttgt tccagaagaa tattggagca tcgacggtaa ttttaaaaaa 600 

40 gctcgcaaga aattcaaagc aaatttctgg ggaatcgacg gtaagaaaaa gaaattacca 660 

gatgcacaaa gtgtaaaaag aagtcactgc tag 693 



<21-0> 91 
<211> 981 
45 <212> DNA 

<213> Streptococcus pneumoniae 



<400> 91 

atgtttattt ccatcagtgc tggaattgtg acatttttac taactttagt aggaattccg 60 

50 gcctttatcc aattttatag aaaggcgcaa attacaggcc agcagatgca tgaggatgtc 120 

aaacagcatc aggcaaaagc tgggactcct acaatgggag gtttggtttt cttgattact 180 

tctgttttgg ttgctttctt tttcgcccta tttagtagcc aattcagtaa taatgtggga 240 

atgattttgt tcatcttggt cttgtatggc ttggtcggat ttttagatga ctttctcaag 300 

gtctttcgta aaatcaatga ggggcttaat cctaagcaaa aattagctct tcagcttcta 360 

55 ggtggagtta tcttctatct tttctatgag cgcggtggcg atatgctttc tgtctttggt 420 

tatcaagtgc atctagggat tttctatatt gttttcgctc ttttctggc.t agtcggtttt 480 

tcaaacgcag taaacttgac agacggtgtt gacggtttag ctagtatttc cgttgtgatt 540 
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agtttgtctg cctatggagt tattgcctat gtgcaaggtc agatggatat tcttctagtg 600 

attctggcca tgattggtgg tttgctcagt ttcttcatct ttaaccataa gcctgctaag 660 

atctttatgg gtgatgtggg aagtttggct ttaggtggaa tgctggcagc tatctctatg 720 

gctctccacc aagaatggac tctcttgatt atcggaattg tgtatgtttt tgaaacaact 780 

5 tctgttatga tgcaagtcag ttatttcaaa ctgacaggtg gtaaacgtat tttccgtatg 840 

acgcctgtac atcaccattt tgagcttggg ggattgtctg gtaaaggaaa tccttggagc 900 

gagtggaagg ttgacttctt cttttgggga gttgggcttc tagcaagtct cctgacccta 960 
gcaattttat atttgatgta a 981 

10 <210> 92 

<211> 2082 
<212> DNA 

<213> Streptococcus pneumoniae 
15 <400> 92 

atggcacgcg aattttcact tgaaaaaact cgtaatatcg gtatcatggc tcacgtcgat 60 
gccggtaaaa caacaactac tgagcgtatt ctttactaca ctggtaaaat ccacaaaatc 120 
ggtgaaactc acgaaggtgc gtcacaaatg gactggatgg agcaagagca agaacgtggt 180 
atcacgatca catctgctgc gacgacagct caatggaaca accaccgcgt aaacatcatc 240 

20 gacacaccag gacacgtgga cttcacaatc gaagtacaac gttctcttcg tgtattggat 300 
ggtgcggtta ccgttcttga ctcacaatca ggtgttgagc ctcaaactga aacagtttgg 360 
cgtcaagcaa ctgagtacgg agttccacgt atcgtatttg ccaacaaaat ggacaaaatc 420 
ggtgctgact tcctttactc tgtaagcaca cttcacgatc gtcttcaagc aaatgcacac 480 
ccaatccaat tgccaatcgg ttctgaagat gacttccgtg gtatcattga cttgatcaag 540 

25 atgaaagctg aaatctatac taacgacctt ggtacggata tccttgaaga agacatccca 600 
gctgaatacc ttgaccaagc tcaagaatac cgtgaaaaat tgattgaagc agttgctgaa 660 
actgacgaag aattgatgat gaaatacctc gaaggtgaag aaatcactaa cgaagaattg 720 
aaagctggta tccgtaaagc gactatcaac gttgaattct tcccagtatt gtgtggttca 780 
gccttcaaaa acaaaggtgt tcaattgatg cttgatgcgg ttatcgacta ccttccaagc 840 

30 ccacttgaca tcccagcaat caaaggtatt aacccagata cagacgctga agaaattcgt 900 
ccagcatctg acgaagagcc atttgcagct cttgccttca agatcatgac. tgacccattc 960 
gtaggtcgtt tgacattctt ccgtgtttac tcaggtgttc ttcaatcagg ttcatacgta 1020 
ttgaatactt ctaaaggtaa acgtgaacgt atcggacgta tccttcaaat gcacgctaac 1080 
agccgtcaag aaatcgacac tgtttactca ggtgatatcg ctgctgccgt tggtttgaaa 1140 

35 gatactacaa ctggtgactc attgacagat gaaaaagcta aaatcatcct tgagtcaatc 1200 
aacgttccag aaccagttat ccaattgatg gttgagccaa aatctaaagc tgaccaagac 1260 
aagatgggta tcgcccttca aaaattggct gaagaagatc caacattccg cgttgaaaca 1320 
aacgttgaaa ctggtgaaac agttatctca ggtatgggtg aacttcacct tgacgtcctt 1380 
gttgatcgta tgcgtcgtga gttcaaagtt gaagcgaacg taggtgctcc tcaagtatct 144 0 

40 taccgtgaaa cattccgcgc ttctactcaa gcacgtggat tcttcaaacg tcagtctggt 1500 
ggtaaaggtc aattcggtga tgtatggatt gaatttactc caaacgaaga aggtaaagga 1560 
ttcgaattcg aaaacgcaat cgtcggtggt gtggttcctc gtgaatttat cccagcggtt 1620 
gaaaaaggtt tggtagaatc t'atggctaac ggtgttcttg caggttaccc aatggttgac 1680 
gttaaagcta agctttatga tggttcatat cacgatgtcg actcatctga aactgccttc 1740 

45 aagattgcgg cttcactttc ccttaaagaa gctgctaaat cagcacaacc agctatcctt 1800 
gaaccaatga tgcttgtaac aatcactgtt ccagaagaaa accttggtga tgttatgggt 1860 
cacgtaactg ctcgtcgtgg acgtgtagat ggtatggaag cacacggtaa cagccaaatc 1920 
gttcgtgctt acgttccact tgctgaaatg ttcggttacg caacagttct tcgttctgca 1980 
tctcaaggac gtggtacatt catgatggta tttgaccact acgaagatgt acctaagtca 2040 

50 gtacaagaag aaattattaa gaaaaataaa ggtgaagact aa 2082 

<210> 93 
<211> 1227 
<212> DNA 
55 <213> Streptococcus pneumoniae 

<400> 93 
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atgccaaatt acaa^attcc attttcaccg 
gtagcggata ccctgcgttc tggttggatc 
cgccgcttgt ctctttacac acagacacct 
gctctggagt tgattttacg cgttttggaa 
5 gccatgacct atacggcttc atgtagtgtc 
gtggatatcc aagcagatac gtttgagatg 
gagaaaacta aggtgattat cccagtagag 
ttgttccaag tcgtggagaa aaaacgtgac 
gcctttaacc gtattgtcat tgtctctgat 

10 ggacaacctt ctggttctat cgctgatttt 
tttacaacgg cagaaggtgg aagtgcgact 
gagatgtaca aggaattcca aatcctttcc 
aagatgcaac tggggtcatg ggaatacgat 
accgatatca tggcttcact tggtttggta 

15 cgccgtaagg acattgtgga ccgctatgat 
ttggcacaca agactgaaac tgtcgaatct 
ggagcaagcc tagaagaacg cagcctcatc 
agtaatgttc actacaaacc gcttcctctc 
atgacgaact atcctaaggc ctatgccttc 

20 actaaattaa gcgatgaaga agtagactat 
aaagtgctaa ctttatcaaa aaaatga 

<210> 94 
<211> 978 
25 <212> DNA 

<213> Streptococcus pneumoniae 



cctgatatca cagaagcaga aattgctgaa 60 
acaacaggtc ctaaaacaaa agaactggag 120 
aagactgttt gtctcaactc tgcgacagcc 180 
gtgggacctg gtgatgaagt catcgttcca 240 
attacgcacg tgggagcaac ccctgtcatg 300 
gactatgacc tgcttgagca agctatcact 360 
ctcgcaggga ttgtttgcga ttatgaccgt 420 
ttctttaccg cttcaagcaa gtggcaaaag 480 
agtgcccacg ctttgggatc tacttataaa 540 
acttccttct cattccatgc cgttaagaac 600 
tggaaagcca atccagtgat tgatgacgaa 660 
cttcacgggc aaactaagga tgctcttgcc 720 
atcgttacac cagcctataa gtgcaacatg 780 
caattggacc gctatccaag tttgttgcaa 840 
agtggttttg caggttctcg catccatcct 900 
tcacgccacc tctacatcac ccgtgtagaa 960 
atccaagaat tggctaaagc aggaattgca 1020 
ttgacagcct ataagaatct tggatttgat 1080 
tttgagaatg aaattaccct ccctcttcat 1140 
atcattgaga ctttcaaaac agtttctgaa 1200 

1227 



<400> 94 
atgacagaac 

30 ' aatgtattta 
gaaattttat 
ttagccgaac 
tatgaccaca 
gactggggtg 

35 gtggaagtgt 
tttgaagggc 
cgaatctcac 
atgccagaat 
* ' accttccgtt 

40 ttaacccaca 
aatagagatc 
aaggctgcgg 
atccgttctt 
gttgctcagg 

45 aagtggcgaa 



ctgatttttg 
aaaacactta 
tggatttttt 
ttgataagat 
acaatgccat 
atatgttgct 
tggattacca 
ctaatgccta 
catttgactc 
tggatgatac 
caggtggtgc 
ttccaactgg 
gtgccatgaa 
aggtagattc 
atgtcttcac 
tagataaggt 
ttagctaa 



gaacgataat 
caataccttc 
ggctgaagac 
aatgaccagc 
cttggaaatc 
tcgtatgtat 
agcaggtgat 
tggtctcctc 
tgccaaacgt 
tattgaagtg 
cggtggacaa 
aattgttgtc 
gatgttgcag 
tctcaaaggt 
gccttatact 
tatggatggg 



attgcggccc 
cataagatgg 
gagtcagtgc 
tacgagatga 
catccaggtt 
actcgttatg 
gaggctggta 
aagtcagaaa 
cgccatacct 
gaaatccgtg 
aacgtcaata 
caatcaacag 
gctaagctct 
gagaaaaagg 
atggtaaaag 
gacctagatg 



aaaaaacgtc 
aagagttgca 
atgatgaact 
ctttactctt 
ctggtggtac 
gtaatgctaa 
ttaagtcggt 
tgggtgttca 
ctttcacatc 
aagatgatat 
aagtttcaac 
tggatcgtac 
atcaaatgga 
agattacttg 
atcaccgaac 
gttttatcga 



gcaagaattg 60 
ggatgaagtc 120 
ggtagcgcag 180 
gtcagaacct 240 
tgaggcgcag 300 
aggctttaaa 360 
aactttatca 420 
ccgcttagtg 480 
tgtagaagtg 540 
caagatggat 600 
aggtgtacgt 660 
ccagtatgga 720 
gcaagagaag 780 
gggaagccaa 840 
tagctttgag 900 
tgcttatctc 960 
978 



<210> 95 
<211> 750 
<212> DNA 
50 <213> Streptococcus pneumoniae 

<400> 95 

atgttttata cttatttgcg tggattagtt gtattgctct 

gctcactatc ataatactga taaaattcct aatcaagatg 

55 cctcaccgta cctggtggga tcctgtttat atggcctttg 

atctttatgg caaaaaaaga actctttacc aaccgtatct 

tgtggcgcct ttcccatcga ccgtgaaaat cccagcgcct 



tatggtccat caatggcaat 60 
aaaattatat tttagttgcg 120 
cgaccaagcc aaaacagttc 180 
ttggttggtg gattcgtatg 240 
cagccatcaa atatcctatc 300 



38 



WO 01/49721 



PCT/US00/35604 



aacgttctca aaaaaagtga ccgctctctc atcatgtttc caagtggtag ccgccactca 360 

aacgatgtca aggggggcgc agcactgatt gccaaaatgg ccaaggtccg tatcatgccg 420 

gttacctaca ccggtcccat gactttgaag ggcttgatta gccgtgaacg tgtcgatatg 4 80 

aactttggaa atccaatcga tatctcagat atcaagaaaa tgaatgatga aggcattgaa 540 

5 acagtcgcca atcgtattca aacagaattc caacgtctgg acgaagaaac gaaacaatgg 600 

cacaatgata aaaaaccaaa tccactctgg tggtttatcc gcatccctgc cctcatcctt 660 

gctattatcc tcgctatcct aaccatcatc tttagcttta tcgcaagctt catctggaac 720 

ccagataaga aaagagaaga acttgcatag 750 

10 <210> 96 

<211> 3102 
<212> DNA 
* <213> Streptococcus pneumoniae 

15 <400> 96 

ttgatcgcac aactagatac aaaaacagtc tatagtttta tggaaagcgt catttcgatc 60 
gaaaagtatg tgagagcagc taaagaatac ggctacactc atttggctat gatggatatt 120 
gacaatcttt atggcgcttt cgactttcta gagattacaa aaaaatacgg cattcatcct 180 
ttgctagggc ttgaaatgac agtgtttgta gatgatcagg gagtaaattt gcgcttttta 240 
20 gctctatcta gtgtgggcta tcagcagttg atgaagcttt cgacagccaa gatgcagggg 300 
gagaaaactt ggtcagtcct gtcccagtac ctggaggata tcgcggtcat tgtgccttat 360 
tttgatagag ttgagtcgtt agaactaggc tgtgattact atataggggt ttatccagaa 420 
acactagcaa gcgaatttca tcatcctatc ttacctcttt atcgggtcaa cgcttttgaa 480 
agcagggata gagaagttct tcaagtttta acagcgatta aagaaaatct accgctcaga 540 
25 gaagttccct tgcgttcgag acaagatgtc tttatatcag caagttcttt agagaaacta 600 
ttccaagagc gttttccgca agctttggac aatttagaaa agcttatttc aggcatttct 660 
tacgacttgg atactagtct gaaactgcct cgttttaatc cagctagacc agcagtagag 720 
gagttgagag agcgtgctga actggggctt gttcagaagg ggttgactag taaagaatat 780 
caagatagac tagaccaaga attgtctgtt attcatgata tgggctttga tgattatttc 840 
30 ttggttgttt gggatttgtt gcgttttgga cgatcgaatg gctattatat gggaatggga 900 
aggggttctg cagtaggcag tttggtttct tatgccttag acatcacggg gattgaccca 960 
gtagagaaaa atctgatttt tgaacgcttt • cttaatcgtg aacgctatac catgcctgat 1020 
attgatattg atatcccaga tatttatcgt ccagatttta tcagatatgt tggtaataaa 1080 
tatggtagta aacatgcggc acaaatcgtt actttttcaa cctttggagc caagcaagct 1140 
35 cttcgagatg tcttgaaacg ctttggtgtg ccagagtatg aattatctgc aattactaag 1200 
aaaatcagtt ttcgtgacaa tcttaagtcg gcctatgagg gcaatctcca gtttcgtcag 1260 
caaatcaata gtaagttaga ataccaaaaa gcttttgaga ttgcttgcaa gatagagggc 1320 
tatccaaggc aaacctctgt ccatgcggct ggtgttgtaa ttagtgacca agatttaacc 1380 
aactacattc ctctaaagta tggtgatgaa attccactga ctcagtatga tgctcatgga 1440 
40 gttgaggcta gcggactttt gaagatggac tttctgggac tacgaaattt gacctttgtc 1500 
cagaagatgc aagagttgct tgctgaaata gaaggtattc accttaaaat tgaagaaata 1560 
gatttggaag acaaagaaac gttagattta tttgcctctg gaaatacaaa aggtatcttt 1620 
caatttgagc aacctggtgc tattcgcttg ctcaaacgtg ttcaaccagt ctgttttgaa 1680 
gatgtcgtag caactacttc tctaaatcga ccgggtgcta gtgactatat caataatttt 1740 
45 gtggcaagaa agcatgggca ggaagaagtg actgttctgg atccagtact ggaggatatt 1800 
ttggctccaa cctacggcat aatgctctat caggagcagg ttatgcaggt tgcccagcga 1860 
tttgccggat ttagtcttgg gaaagccgat attttgcgtc gggctatggg gaaaaaggat 1920 
gcctctgcca tgcatgagat gagggcttcc tttattca'ag gttcattaga agctggtcat 1980 
actgtggaaa aagcagagpa ggtctttgat gttatggaga agtttgcagg ctatggtttt 2040 
50 aacaggtcac acgcctatgc ctactcagca ttggccttcc agttggctta ttttaaaaca 2100 
cattatccag ccatatttta tcagatcatg ttgaattctg ccaacagtga ttacttaata 2160 
gatgcacttg aagcaggttt tgaagtggcg cctctgtcca tcaacacgat tccetatcac 2220 
gataaaattg ccaacaaggc catctatcta ggtttgaaat ccattaaagg agtcagtaat 2280 
gatttagctc tctggattat tgaacataga ccttattcta atattgaaga ttttatagct 2340 . 
55 aaattacctg agaattatct gaaacttcct ctgctagaac ctttggtaaa agttggtctt 2400 
ttcgattcat ttgaaaaaaa tcgtcaaaaa gtatttaata acttagctaa tctatttgaa 2460 
tttgtgaaag agttgggaag tttgtttgga gatgctattt atagttggca ggaatcggaa 2520 
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gattggacgg aacaagaaaa attttatatg 
aaacatccac tacaagctat tgcaagtaag 
ttgtcagaaa atagctatgc tattattttg 
accaaaaagg gtgaaaatat ggccttctta 
5 gtcactctct tttcagactt atatcgtcag 
tactatgtaa aaggaaaaat acaatcacgt 
ataagagaag cagttgctga acgcttttgg 
gaaatttcac gtattttaga acaatttaaa 
gaggaacaga aaaccatcgt ttctccccat 
10 gagaaattga atgaaatcgt tatgaaaacg 

<210> 97 

<211> 921 

<212> DNA 

15 <213> Streptococcus pneumoniae 



gaacaagagc ttttagggat aggtgtcagc 2580 
gctatttacc cgattacccc aatcggaaat 2640 
gttgaagttc agaaaataaa agtgattcgt 2700 
caggcagatg atagtaagaa aaaattggat 2760 
gttggacagg aaataaaaga gggagccttc 2820 
gatggccgtc tgcaaatgat tgcacaagaa 2880 
atacaggtga aaaatcatga atcggatcaa 2940 
ggcccaatcc cagtcatcat ■ ccggtatgaa 3000 
cattttgtag ctaaatccaa tgaattagag 3060 
atttatcgct aa 3102 



<400> 97 

atgaccaacg 

cgaaagacaa 

20 caaatcagtc 
tacaagcgaa 
aaatctcaac 
agtcgcctac 
acaactgatg 

25 aaaggatttc 
aatggacaag 
aagaaacaaa 
aatccacaaa 
gctggagtgg 

30 gcccaaaacg 
gagtcctttg 
gaaccaacca 
gaaatttact 



aatttttaca 
cacctccttt 
tccaagacgt 
ctaaggaaga 
cttttattat 
ttcaaatcct 
gttttctcta 
ctgaaagcta 
atgtagatat 
gtgtcaaagc 
acgatcgtct 
atgatattga 
accctgatag 
cccatcaggt 
gaaatcgtgc 
taaaaaagta 



ttttgaaaaa 
gacagaagaa 
tacagatatc 
tttagccttt 
tggggtttct 
actgtcccgt 
tcccaatcaa 
tgatatggaa 
tcctgtctat 
tgctgatttt 
ctatatcact 
aagttggtat 
ctactattat 
ctggaccagt 
agaagtgatt 



atcagccgcc 
gaattggaat 
tatctcccct 
tcaaaaggaa 
gggagtgttg 
acgtttacag 
accttgattg 
gctcttctca 
tctcatgaag 
gtaatcgttg 
gacttctttg 
ctggaccgtt 
cgttttactc 
atcaatctca 
cttcataaaa 



agacttggca 
ctatcaagag 
tggctcatct 
ttttcctcca 
ccgttggaaa 
atgctacggt 
agcagggaat 
acttcttgga 
tttacgacat 
agggaattaa 
acttttccat 
tcttgaaaat 
agatgccgat 
caaatctgca 
gcaagaacca 



atctttacat 60 
ttttaatgac 120 
gattcagatt 180 
acgtgaaagt 240 
atccacaacc 300 
tgagttggtt 360 
tttaaatcgt 420 
ccgcatcaaa 480 
cgtacccaaa 540 
tgtctttcaa 600 
ctatgtagat 660 
gctgagtcta 720 
tggggaagtg 780 
aaattatatt 840 
tgaaatcgat 900 
921 



35 <210> 9'8 
<2J1> 741 
<212> DNA 

<213> Streptococcus pneumoniae 



40 <400> 98 

atggaaattt 
aaccactatg 
catcgtgcag 
gatacccaga 

45 gaaaatcaaa 
ttggaagtcc 
atcggcttga 
gaattgctca 
attatcaccc 

50 cttgagtcag 
agtgagattc 
gttcgttttg 
aacgaggagg 

55 <210> 99 
<211> 831 
<212> DNA 



cattattaac 
tcaatagagc 
ggaatatcgc 
tcgatacagt 
agattcacca 
ttgctattat 
ttcgtggaga 
aggctggtca 
agtctattgg 
gtgactatct 
gtgatattgt 
ctaacaatgc 
atgaagaatg 



agatgttggt 
tggacgtacc 
tagt'gaaatg 
caatgaagtg 
gcttggtcag 
tgataatcag 
agaataccat- 
attgacacca 
gcaaaaagat 
cttgctcaat 
aaccagtgat 
aggaggttta 



cagaaacgaa 
atgattattt 
gcggtcacag 
cgtgaatggt 
gatgaagctt 
gctatctatg 
cagttgacga 
gaagaggcag 
gaaattcagc 
agtgacggct 
attcctttag 
gacaacatta 



caaataacca 
tagctgatgg 
acctgggtgt 
tcgcccatta 
acagaggcat 
ctcatattgg 
gcgatcattc 
aagctcatcc 
ctgattttgg 
tgaccaacat 
cagataaaac 
cggttgccct 



agactatgtc 60 
gatgggaggt 120 
agcttgggtt 180 
cctagaaatt 240 
gggaactact 300 
tgattcgcgt 360 
cttggttaat ■ 420 
gcaaaaaaat 480 
gacagttatc 540 
gatttcaggc 600 
ggagacactt 660 
tgtttctatg 720 
741 
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<213> Streptococcus pneumoniae 



<400> 99 

gtgacgatac agatgaagaa tacaggtaaa 
5 cagagtcaaa gggtcttgta tgaattgcga 
aatgatacca atccggatat tgtcatttcc 
tttcataagt acgaaaatca gcttgacaag 
ttgggcttct atacagatta tcgtgatttt 
ctagatactg gggcaagggt ttcttaccct 

10 ggtgaagtta agattttcag agcactcaac 
atggtggcag atattgtaat aaatggtgtt 
acagtttcga caccgactgg- tagtactgcc 
caccctacca ttgaagcttt gcaattaacg 
cgaacactgg gctcttccat tattgtgcct 

15 agaaacgatt atcatactat ttcggttgac 
cgtattgagt atcaaatcga ccatcataag 
agtttctgga accgtg.ttaa ggatgccttt 

<210> 100 
20 <211> 1623 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 100 

25 atgtcaaaag aaattaaatt tt'catcagat 
attcttgcag acactgttaa agtaaccttg 
aagtcattcg gttcaccctt gattaccaat 
ttggaagacc attttgaaaa tatgggtgct 
aatgatatcg caggtgacgg aactacgact 

30 gaaggaatca aaaacgtcac agcaggtgca 
acagcagttg ccgcagcagt tgaagctttg 
gaagctatcg ctcaagttgc agcagtatct 
tctgaagcaa tggaaaaagt tggcaaagac 
atggaaacag agcttgaagt cgtagaagga 

35 tacatggtga cagatagcga aaaaatggtg 
acagacaaga aaatttccaa catccaagaa 
agcaatcgtc cactcttgat tattgcggat 
gttttgaaca agattcgtgg aaccttcaac 
gaccgtcgca aagccatgct tgaagatatc 

40 gaagaccttg gtcttgagtt gaaagatgcg 
gtgaccgtgg acaaagatag cacggttatt 
tctcaccgtg ttgcggttat caagtctcaa 
gaaaaattgc aagaacgctt ggccaaattg 
gccgcaactg aaactgagtt gaaagaaatg 

45 actcgtgcag ctgttgaaga aggtattgtt 
attccagctg ttgctacctt ggaattgaca 
ctccgtgctt tggaagaacc cgttcgtcaa 
atcgttatcg atcgtttgaa aaatgctgag 
gagtgggtta acatgattga tcaaggtatc 

50 ctacaaaatg cagcatctgt agccagcttg 
aaaccagaac cagtagcccc agctccagca 
taa 

■ <210> 101 
55 <211> 1446 
<212> DNA 

<213> Streptococcus pneumoniae 



cgaattgatc tgatagccaa tagaaaaccg 60 
gatcgtttga agagaaatca gtttatactc 120 
attggcgggg atggtatgct cttgtcggcc 180 
gtccgcttta tcggtcttca tactggacat 240 
gagttggaca agctagtgac taatttgcag 300 
gttctgaatg tgaaggtctt tcttgaaaat 360 
gaagccagca tccgcaggtc tgatcgaacc 420 
ccctttgaac gttttcgtgg agacgggcta 480 
tataacaagt ctcttggcgg tgctgtttta 540 
gaaattgcca gccttaataa tcgtgtctat 600 
aagaaggata agattgaact tattccaaca 660 
aatagcgttt attctttccg taatattgag 720 
attcactttg tcgcgactcc tagccatacc 780 
atcggtgagg tggatgaatg a 831 



gcccgttcag ccatggttcg tggtgtcgat 60 
ggaccaaaag gtcgcaatgt cgttcttgaa 120 
gacggtgtga ccattgccaa agaaatcgaa 180 
aagttagtat cagaagtagc ttctaaaacc 240 
gcaacagtct tgacccaagc tatcgtccgt 300 
aatccaatcg gtattcgtcg tgggattgaa 360 
aaaaacaacg ccatccctgt tgccaataaa 420 
tctcgttctg aaaaagttgg tgagtacatc 480 
ggtgtcatca ccatcgaaga gtcacgtggt 540 
atgcagtttg accgtggtta cctttcacag 600 
gctgaccttg aaaatccgta cattttgatt 660 
atcttgccac ttttggaaag cattctccaa 720 
gatgtggatg gtgaggctct tccaactctt 780 
gtagtagcag tcaaggcacc tggttttggt 840 
gccatcttaa caggcggaac agttatcaca 900 
acaattgaag ctcttggtca agcagcgaga 960 
gtagaaggtg caggaaatcc tgaagcgatt 1020 
atcgaaacta caacttctga atttgaccgt 1080 
tcaggtggtg tagcggttat taaggtcgga 1140 
aaactccgca ttgaagatgc cctcaacgct 1200 
gcaggtggtg gaacagctct tgccaatgtg 1260 
ggagatgaag caacaggacg taatattgtt 1320 
attgctcaca atgcaggatt tgaaggatct 1380 
cttggtatag gattcaacgc agcaactggc 1440 
attgatccag ttaaagtgag tcgttcagcc 1500 
attttgacaa cagaagcagt cgtagccaat 1560 
atggatccaa gtatgatggg cgggatgatg 1620 

1623 



41 



WO 01/49721 



PCTWSOO/35604 



<400> 101 

atgattaaga ttgaaaccgt attagatatt 
attgaccaag gtcattacca ctacaactac 
5 gacagccgaa aagtaacaga agacactctt 
gaataccttc tttctgctat aacacaaggt 
gaagtcgata tccctgtcat cattgtgaac 
atggagttct atggtaatcc acaagagaaa 
ggtaagacaa cagcaaccta tttcgcctat 

10 atgttgtcga ccatgaacac aactcttgat 
acccctgaga gtattgacct ctttgacatg 
cacctcatca tggaagtctc cagtqaagcc 
tttgatgtag gagtctttct taacatcact 
agctttgaag actatttcta ccacaagcgt 

15 attaacagtg acatggacca cttctcagtc 
gatttctatg gtagccaatt. tgataaccaa 
gctacgggta aactcgctgg agattatgat 
aatgcagttg ctgctggact tgcttgtctc 
aaaggcatcg ctgcaacccg cgttcctggt 

20 gccaaggtct tcatcgacta tgcccacaat 
gttgaaactc atcaaaccgg aaagattgct 
gaaagtcgtc gtaaggactt tggcctcctc 
ctgactgctg atgaccctaa ctatgaagac 
tacatcaatc atcctgttga aaagattgcg 

25 gctatcacaa atcacgaatt agatgcagtt 
caaatcatcc agggcaagaa agaatcctac 
ttataa 

<210> 102 
30 <211> 1980 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 102 

35 atgatccaaa tcggcaagat ttttgccgga 
ggaggtatgg cggatgtcta cctagccaaa 
gtgaaggttc tgaggaccaa ctaccagacg 
gaagcgagag ctatggcaga tctagaccat 
gaggaagacg gtcaacagta cctagctatg 

40 tatatcaagg aacattatcc tctttctaat 
ctcttggcta tgcgcttggc ccatactcga 
aatatcctct tgacaccaga tgggactgcc 
tttgcagaga caagtctgac ccagactaac 
ccagagcagg cgcgtggttc gaaggcgact 

45 attttctatg agatgctgac aggccatatc 
gccctccagc atttccagaa acccctgccg 
caggctttag aaaatgttat tatcaaggca 
tcggtttcag agatgtatgt ggacttgtct 
agtaagttaa tctttgatga aacgagcaag 

50 cagagtacct tgacatctat tcctaaggtt 
aacccaagcc aggctgtgac agaggaaact 
tttaagatgc gttacctgat tttgttggcc 
tggatactat ccagaactcc tgcaaccatt 
gcagaggcca aggcaacgct caaaaaagcc 

55 gctagtgaaa aggtggaaga agggcggatt 
cgaaaagaag gaacgaaaat caatttggtt 
agtaattatg tcggtcggaa atcctctgat 



ttaaagaaag atggcctttt tcgcgaaatt 60 
agcaaagtta tttttgatag catcagctac 120 
ttttttgcaa aaggcgctgc ctttaaaaaa 180 
ttagcttggt atgtagctga aaaggactac 240 
gatataaaga aagccatgag tttgattgcc 300 
ctcaaactcc ttgcctttac tggtactaag 360 
aacatcttat ctcaggggca tagacctgct 4 20 
ggcgagactt tctttaagtc agcgttgaca 480 
atgaatcagg ctgtgctaaa tgaccgtacc 540 
tatctagtcc atcgagtqta tggactgacc 600 
cctgaccata tcggcccgat tgaacaccct 660 
ctcttgatgg aaaatagccg agcagtcatc 720 
ttgaaagaac aggttgaaga tcaagaccat 780 
atcgagaatt ccaaagcctt tagcttttca 840 
atccaactca ttggcaactt caaccaagaa 900 
cgtctcggag caagtcttga ggacatcaaa 960 
cgtatggaag tcctcactca gaaaaatgga 1020 
ggggatagtc tgaaaaaact catcaatgtg 1080 
ctggttctgg gatcaacagg aaacaaggga 1140 
ctcaatcaac accctgagat tcaagtcttt 1200 
ccaatggcca ttgcagatga aattagtagc 1260 
gatcgccaag aagccatcaa ggcggcaatg 1320 
attattgcgg gtaagggagc cgattgttac 1380 
ccaggagata cagccgtcgc agaaaattat 1440 

1446 



cgctatcgga ttgtcaaaca gattggtcga 60 
gacttaatct tagatgggga agaagtggca 120 
gacccgatag ctgtagctcg ttttcagcgt 180 
cctcatatcg ttcggataac agatattggc 240 
gagtatgtgg ctggactgga cctcaaacgc 300 
gaagaagcag tccgtatcat gggacaaatt 360 
ggaattgttc acagggactt gaaacctcaa 420 
aaggtcacag actttgggat tgctgtagcc 480 
tcgatgttgg gctcagttca ttacttgtca 540 
gtgcagagtg atatctatgc catggggatt 600 
ccttatgacg gggatagcgc ggtgaccatt 660 
tccgttattg cagaaaatcc atctgtacct 720 
actgctaaaa agttgaccaa tcgctaccgc 780 
agtagcttgt cctacaatcg tagaaatgaa 840 
gcagatacca agaccttgcc gaaggtttct 900 
caagcgcaaa cagaacacaa atcaatcaaa 960 
taccaaccac aagcaccgaa aaaacataga 1020 
agccttgtat tggtggcagc ttctcttatt 1080 
gccattccag atgtggcagg tcagacagtt 114.0 
aattttgaga ttggtgagga gaagacagag 1200 
atccgtacag atcctggcgc tggaactggt 1260 
gtctcatcag gcaagcaatc tttccaaatt 1320 
gtcattgcgg aattaaaaga gaaaaaagtt 1380 
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ccagataatt tgattaaaat tgaggaagaa gagtcgaatg agagtgaggc tggaacggtc 1440 

ctgaagcaaa gtctaccaga aggtacgacc tatgacttga gcaaggcaac tcaaattgtt 1500 

ttgacagtag ctaaaaaagc tacgacgatt caattaggga actatattgg acggaactct 1560 

acagaagtaa tctcagaact caagcagaag aaggttcctg agaatttgat taagatagag 1620 

5 gaagaagagt ccagcgaaag cgaaccagga acgattatga aacaaagtcc aggtgccgga 1680 

acgacttatg atgtgagtaa acctactcaa attgtcttga cagtagctaa aaaagttaca 1740 

agtgttgcca tgccgagtta cattggttct agcttggagt ttactaagaa caatttgatt 1800 

caaattgttg ggattaagga agctaatata gaagttgtag aagtgacgac agcgcctgca 18 60 

ggtagtgcag aaggcatggt tgttgaacaa agtcctagag caggtgaaaa ggtagacctc 1920 

10 aataagacta gagtcaagat ttcaatctac aaacctaaaa caacttcagc tactccttaa 1980 

<210> 103 
<211> 1176 
<212> DNA 
15 <213> Streptococcus pneumoniae 

<400> 103 

atgaaacatt ttgatactat tgtcatcggt gggggacctg ctggtatgat ggctacgatt 60 
tccagtaact tttatggaca gaaaaccctc ctcatcgaaa aaaatcggaa acttggaaaa 120 

20 aaattagctg ggactggtgg gggacgttgc aatgtgacca acaatggtag cttagacaac 180 
ctgctagctg gaattcctgg aaacggacgc tttctttaca gtgttttctc ccagttcgat 240 
aatcatgaca tcatcaactt ttttacagaa aatggtgtta aacttaaggt cgaagaccac 300 
ggacgcgtct ttccagccag tgacaagtct cggactatta tcgaagcttt ggaaaagaaa 360 
atcactgaac taggtggtca agttgctact caaatagaaa tcgtttctgt taaaaaagta 420 

25 gatgaccagt ttgtccttaa gtcagcggat caaaccttca cttgtgagaa actcattgtc 480 
acaacaggtg gtaagtctta tccttcgact ggttcgactg gttttggtca cgagattgct 540 
cgccatttta agcataccat caccgatctt gaggctgctg aaagtccttt attaacagat 600 
tttccacata aagccttaca agggatttct ctggacgatg tgaccctaag ttatggtaag 660 
catgtcatca ctcatgattt actctttacc cactttggtt tgtcaggtcc tgctgcccta 720 
.30 cgcatgtcta gctttgtcaa aggtggggag gttctctcac tcgatgtttt gcctcaactt 780 
tctgagaagg acttggttac atttctagaa gaaaatcggg aaaaatcctt gaaaaacgct 840 
ttaaaaacct tgttaccaga acgcttggcc gaattttttg tacaaggata tcctgaaaaa 900 
gtcaaacagc tgactgaaaa ggaacgagaa caacttgtcc agtccattaa agaacttaaa 960 
attcctgtaa ctggaaaaat gtcccttgca aagtcctttg ttaccaaggg tggagtcagt 1020 

35 ctcaaggaaa tcaatcctaa aacccttgaa agtaagctgg tacctggcct ccactttgca 1080 
ggcgaagtta tggatatcaa tgcccacacg ggtggcttta acatcacttc tgccctctgt 114 0 
accggctggg tggcgggaag tctgcattat gattaa 117 6 

<210> 104 
40 <211> 696 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 104 

45 . atgctgaaat gggaagactt gcctgtggaa atgaaatcaa gcgaggttga gtcttactac 60 
cagcttgtct ctaaaaggaa gggttcgctg attttoaagc gttgcttgga ctgggttttg 120 
gccttggtgc ttacatgggt tctaacttct cccatctttc tcatcttgag catttggatc 180 
aagttggata gcaaggggcc agtgatttac aagcaagagc gtgtgaccca gtacaaccgt 240 
cggttcaaga tttggaagtt ccgtaccatg gtgacggatg cggataaaaa aggaagtctg 300 

50 gtgacttctg ctaacgatag ccgtattacc aaggttggaa atttcatccg acgtgtccgt 360 
ttggacgaac tgcctcagtt ggtcaatgtc cttaaaggtg agatgtcctt tgtcggtaca 420 
cgacctgaag tgccacgtta tacagagcag tatagccctg aaatgatggc aaccttgctc 4 80 
ttgcaagcag gaattacctc tccagccagc atcaactaca aggatgagga caccatcatc 540 
agtcaaatga cggagaaagg tctgtcagtt gatcaggcct atgtggagca tgttcttcct 600 

55 gaaaagatgc gctataacct cgcctatctc cgagagttta gtttctttgg ggacatcaaa 660 
atcatgtttc aaaccgtgtt tgaggtacta aaataa 696 
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<210> 105 
<211> 423 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 105 

atgactagtc cactattaga atctagacgc 
atgagccttg agttcggtac ggatgtcgaa 
cgtgaatata cggatgtaca acttccagcc 
10 gctaaaaagg aagaactaga taagcaaatc 
gaacgcttaa cgctcgtgga gagaaacctc 
tttgacactc ctcagctggt tgctgttaat 
gatcaaaaat ctgcccgttt tatcaatgga 
taa 

15 

<210> 106 
<211> 3540 
<212> DNA 

<213> Streptococcus pneumoniae 

20 

<400> 106 

atgtatttaa aggaaatcga aattcagggg 
gtttttgacc aaggtgtgac ggcagttgtt 
acagaaagtc tgcgttgggc tttgggggag 

25 atgccggatg tcatctttgc tggaacagaa 
gttgtgactc tggataatca tgacggattt 
gaacgccata tctatcgtag tggagatagc 
ctgcgtgata ttcatgacct cttcttggat 
atttcccaag ggaaggttga ggagattttt 

30 tttgaagaag ctgctggagt tttaaaatac 
ctgcagca£a ctcaggataa tctggaccgc 
caaatcaagc ctcttgagaa gcaagctgag 
caacgtaagg ctatttattt agacgttctg 
ctagagtcga cagaagaaga gttggctcag 

35 aagcgtgaaa aattagaaga agaaaatcaa 
gctgaaatgg ccaaagacca aggcagtttg 
gaaagaaaat tagccctatc gaaactggag 
gcacaagctc gtttggctgc tttggaggat 
gataaagaaa gctctttagc tctgttagag 

40 aatcgtttag aagctgaatt gctggctttc 
ttacgtgaac gctttgtagc tcttttacaa 
cgtattgaga atgagttgga aaatagtcgt 
gaaaagctga aagagcaatt agctacagct 
cttgaaactg ccaaggtgca ggttcagaaa 

45 gagcaagagg agcagaaaac ttcctatcaa 
gatagtctca aaaacaagca ggccagagct 
agtaactttt atgcaggtgt taagagtgtt 
attggtgcag tcagtgagca tctgaccttt 
gccttagggg caagtagcca gcatatcatc 

50 attgatttcc tcaaacgaaa cagagtcggt 
aaggcgcgta cgatttctag tcagaaccaa 
gggatggcag atgagttggt gacttttgat 
ctagctacga cggctatctt tgataccgta 
cgttatcagg ttcgtatggt gacattggat 

55 gcgggtggtg ccaatcgcca aaataacagt 
caaaaagaaa ttgctgcaga tgaagcaagc 
ttgcaagacc agatggctgc attgacagaa 



caactccgta aatgcgcttt tcaagctctc 60 

actgcttgtc gtttcgccta tactcatgat 120 

tttttgatag acctcgtttc tggtgttcaa 180 

actcagcatt taaaagcagg ttggaccatt 240 

cttcgcttgg gagtctttga aatcacttca 300 

gaagctatcg agcttgcaaa ggacttctcc 360 

ctgctcagcc agtttgtaac agaagaacaa 420 

423 



ttcaagtctt ttgctgataa gaccaaggtc 60 
ggacccaatg gatctggaaa gtccaatatt 120 
tctagtgtca agagtctccg tgggggcaag 180 
agtcgcaaac cgctcaatta tgcttctgta 240 
atcaaggatg caggtcaaga aatcagggtg 300 
gaatacaaga ttgacggcaa gaaagtccgt 360 
actggattgg gacgagattc cttctctatt 420 
aattccaagc ctgaggaacg acgagctatt 480 
aagactcgca gaaaagaaac egagagtaaa 540 
ttagaggaca ttatctacga gttggataat 600 
aatgcccgta agtttttaga cttggaagga 660 
gttgctcaaa tcaaggaaaa taaggcagaa 720 
gttcaagaac tcttgatgag ttattaccaa 780 
actcttaaaa agcaacgcca agatttacag 840 
atggacttga ctagtctgat tagtgattta 900 
tccgagcaag tggccctgaa tcaacaggag 960 
aagagaaatt cactcagcaa agaaaagtat 1020 
ggaaatctag tccaaaataa tcaaaaactc 1080 
tcagacgatc ctgatcagat gattgagctc 1140 
gaagaagcgg atgtctc'aaa ccagttgacc 1200 
cagctttctc aaaaacaagc agatcaacta 1260 
aaagagaagg ctagtcagca aaaagacgag 1320 
ttattggctg actatcaagc tattgccaag 1380 
gctcaacaaa gtcaactctt tgaccgtctg 144 0 
caaagtttgg aaaatatcct gagaaatcat 1500 
ctccaagaaa aagatcgcct aggtgggatt 1560 
gatgtttatt atcaaactgc cctagagatt 1620 
gtagaagatg aagagtcggc aaccaaagct 1680 
cgtgcaacct ttcttccttt gaccactatt 174 0 
gatgctatcg ctgtaagccc aggtttcctt 1800 
actagactgg aagccatttt caagaacttg 1860 
gaacatgcgc gtgaagctgc tcgacaagtt 1920 
gggacagaat tacgcacggg tggttcctat 1980 
attttcatca agccagaact ggagcaatta 2040 
ttgggttcag aagaagcggc tttgaagacc 2100 
aga-ttagaag ccatcaaatc tcaaggagag 2160 
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caggcacgta ttcaggagca aggcttgtcc ctcgcttatc agcaaactag tcagcaagtt 2220 
gaagaactgg aaactctttg gaaactccaa gaagaggaaa tagatcgtct ttctgaggga 2280 
gattggcaag cggataagga aaaatgtcaa gagagccttg ctactatcgc cagtgacaag 2340 
caaaatctgg aagctgagat tgaagaaatt aagtctaata aaaacgccat ccaagaacgc 2400 
5 tatcaaaatt tgcaggaaga ggtggcgcaa gctcgcttgc ttaagacaaa actgcaaggg 2460 
caaaaacgtt atgaagtagc tgatattgag cgtttaggca aggaattgga caatcttaat 2520 
atcgaacaag aagaaattca gcgcatgctc caagaaaaag ttgacaatct tgagaaggtt 2580 
gatacagaat. tgctcagtca acaggcggaa gaatccaaaa ctcagaaaac aaatctccaa 2640 
caaggtttga ttcgcaagca gtttgagttg gatgatatag aaggtcaact ggatgatatt 2700 

10 gccagtcact tggatcaagc tcgccagcag aatgaggagt ggattcgcaa gcaaacacgt 27 60 
gctgaagcca agaaagaaaa ggtcagcgag cgcttgcgcc atctacaaaa tcaattaaca 2820 
gaccagtacc agattagcta tactgaagca ctagaaaagg cacatgaatt ggaaaacctc 2880 
aatctggcag agcaagaggt gcaggattta gagaaggcta ttcgctcatt gggacctgtc 2940 
aacttggaag ctattgacca gtacgaagaa gttcacaacc gtctggactt tctaaatagt 3000 

15 cagcgagatg atattttgtc agcgaaaaat ctgctccttg aaaccattac agagatgaat 3060 
gatgaggtca aggaacgctt taaatcaacc tttgaagcta ttcgtgagtc ctttaaagtg 3120 
acctt.caagc agatgtttgg cggaggtcag gcagacttga tattgactga gggcgacctt 3180 
ttaacagctg gtgtggagat ttctgttcaa cctccaggta agaaaatcca gtcgcttaac 3240 
ctcatgagtg gtggtgaaaa agctctatcg gctcttgcct tgcttttctc cattattcgt 3300 

20 gtcaagacca ttccttttgt catcttggat gaggtggaag ctgcgctgga tgaagccaat * 3360 
gttaaacgtt ttggggatta cctcaaccgc tttgacaagg acagccagtt tatcgtcgta 3420 
acccaccgta agggaaccat ggcagcggct gattccatct atggagtgac catgcaagaa 3480 
tcaggtgtct caaaaattgt ttcggttaag ttaaaagatt tagaaagtat tgaaggatga 3540 

25 <210> 107 
<211> 1344 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 107 

atgacaaaac gtgtaacgat tattgacgta aaagactatg ttggtcagga agtgacgatt 60 
ggcgcttggg ttgccaacaa atcaggaaaa ggaaaaatcg ctttcttaca attgcgtgat 120 
ggaacagcct tctttcaagg tgtggctttt aaaccaaact ttgtcgaaaa atttggtgaa 180 
gaagtgggac ttgagaagtt tgatgttatc aaacgcttga gccaagaaac gtctgtttat 240 
gtgacaggta ttgtcaaaga ggacgaacgt tctaaatttg gctatgagtt ggacatcaca 300 
gacatcgaag tgatcggtga atctcaagac tacccaatca caccaaaaga acacggaaca 360 
gactttttga tggataaccg tcacttgtgg ctacgctctc gtaagcaagt agctgtgttg 420 
caaatccgta acgctattat ctatgcaact tatgagttct ttgacaagaa cggttttatg 480 
aagtttgaca gcccaattct ttcaggaaat gcggcagaag attctacaga actctttgaa 540 
actgactact tcggaacgcc agcctacttg agccaatcag gtcagcttta cctagaagca 600 
ggggctatgg ctcttggtcg tgtctt'tgac tttggtccag ttttccgtgc tgaaaaatca 660 
aaaacacgcc gtcacttgac tgagttctgg atgatggatg ctgagt'actc atacttgaca 720 
catgatgagt cgcttgactt gcaagaagct tatgtgaaag ctcttctaca aggtgttctt 780 
gaccgcgcgc ctcaagcctt ggaaaccttg gaacgtgata cagaactctt gaaacgctac 840 
attgcagagc cattcaaacg tatcacttac gatcaagcca ttgacctctt gcaagagcat 900 
gaaaatgatg aagatgctga ctacgagcat cttgagcatg gtgatgactt tgggtcacca 960 
cacgaaactt ggatttcaaa ccactttggt gtgccaacat ttgtcatgaa ctatccagca 1020 
gccatcaagg ccttctacat gaaaccagtt cctggaaatc cagagcgcgt gctttgtgca 1080 
gacttgcttg ctccagaagg ctat'ggagaa attatcggtg ggtctatgcg tgaggaagat 114 0 
tacgatgccc ttgtcgctaa gatggatgaa cttggcatgg atcgtacaga atatgaattc 1200 
taccttgacc ttcgtaaata cggtacagtt ccacacggag gatttggtat cggtatcgaa 1260 
cgtatggtaa ccttcgcagc aggaacaaaa catatccgtg aagctattcc attcccacgt 1320 
atgttgcacc gtatcaaacc ataa 134 4 

55 <210> 108 
<211> 927 
<212> DNA 
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<213> Streptococcus pneumoniae 
<400> 108 

atgtctgaaa aattagtaga aatcaaagat ttagaaattt ccttcggtga aggaagtaag 60 
5 aagtttgtcg cggttaaaaa tgctaacttc tttatcaaca agggagaaac tttctcgctt 120 
gtaggtgagt ccggtagtgg gaaaacaact attggtcgtg ctatcatcgg tctaaatgat 180 
acaagtaatg gagatatcat ttttgatggt caaaagatta atggtaagaa atcgcgtgaa 240 
caagctgcgg aattgattcg tcgaatccag atgattttcc aagaccctgc cgcaagtttg 300 
aatgaacgtg cgactgttga ttatattatt tctgaaggtc tttacaatca ccgtttattt 360 

10 aaggatgaag aagaacgtaa agagaaagtt caaaatatta tccgtgaagt aggtcttctt 420 
gctgagcact tgactcgtta ccctcatgaa ttctcaggcg gtcaacgtca acgtatcggt 480 
attgcccgtg ccttggtcat gcaaccagac tttgttattg cagatgagcc aatttcagcc 540 
ttggacgttt ctgtacgtgc ccaagtcttg aacttgctca aaaaattcca aaaagagctc 600 
ggcctgacct atctcttcat cgcccatgac ttgtcggttg ttcgctttat ttcagatcgt 660 

15 atcgcagtta tttacaaggg tgttattgta gaggttgcag aaacagaaga attgtttaac 720 
aatccaattc acccatatac tcaagccttg ctttcagcgg taccaatccc agatccaatc 780 
ttggaacgta agaaggtctt gaaggtttac gacccaagtc aacacgacta tgagactgat 840 
aagccgtcta tggtagaaat ccgtccaggt cactatgttt gggcgaacca aaccgaattg 900 
gcacgttatc aaaaaggact aaactag ' 927 

20 

<210> 109 
<211> 1275 
<212>* DNA 

<213> Streptococcus pneumoniae 

25 

<400> 109 

atgaagataa gttggaatgg attttctaaa aaatcatacc aagagcgcct cgagctgcta 60 
aaagctcagg cgctccttag tcctgagaga caagctagtc tggagaagga tgaacagatg 120 
agtgtgactg tggcagacca gctgagtgag aatgtggtgg gaactttttc tctgccttat 180 

30 tcgctggttc cggaggtact tgtcaacggt caggaataca ccgttcccta tgtgacagaa 240 
gaaccctctg tggttgcggc ggccagctat gccagcaaaa tcatcaagcg tgcaggtggt 300 
tttactgcac aagtccatca gcgccagatg attgggcagg tagcccttta tcaaattgct 360 
aatcctaaac tagcgcaaga gaagattgcc agcaagaaag cggagctctt ggagrcttgcc 420 
aatcaagcct atccttctat cgttaaacgt gggggtgggg cgcgtgatct gcatgtcgag 480 

35 cagataaaag gcgaaccaga ctttctcgtt gtttatattc atgtcgatac ccaggaagcc 540 
atgggtgcca atatgctcaa caccatgctg gaagccttga aaccagtctt agaagaactc 600 
agtcagggac agagtctcat gggaatcctg tccaactacg cgactgattc tctggtgact 660 
gcaagctgtc gcatcgcctt tcgctacttg agccgccaaa aggatcaagg acgagagatt 720 
gcggagaaaa ttgcgttggc tagtcagttt gcgcaggctg atccttaccg agctgctact 780 

40 cataataaag gaatttttaa tggtattgat gcgattttga ttgccactgg taatgactgg 840 
cgtgccatcg aagctggggc ccatgccttt gccagtcgag atggacgcta tcaaggtctt 900 
agctgctgga cgctggacct tgaaagagaa gaattggtcg gtgagatgac cctgcccatg 960 
cctgtagcga ctaagggtgg ctctatcggc ctcaacccac gtgtagctct cagtcatgat ■ 1020 
ctactaggaa atccttctgc cagagaatta gcccagatta tcgtgtccat cggtcttgct 1080 

45 caaaattttg cagccctcaa agccttggta agtacgggca tccagcaagg ccacatgaaa 1140 
ctacaggcca aatccctagc tctcctagct ggggctagtg aatctgaagt tgctccccta 1200 
gtagagcgcc tcatctcaga taaaaccttt aacctagaga cagcccagcg ctatctcgaa 1260 
aatttaagat cataa ■ 1275 

50 <210> 110 
<211> 789 
<212> DNA 

<213> Streptococcus pneumoniae 
55 <400> 110 

atgccaatta catcattaga aataaaggac aagacttttg gaactcgatt cagaggtttt 60 
gatccagaag aagtcgatga atttttagat attgtggttc gtgattacga agatcttgtg 120 
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cgtgcgaatc atgataaaaa tttgcgtatt aagagtttag aagagcgttt gtcttacttt 180 
gatgaaataa aagattcatt gagccagtct gtattgattg ctcaggatac agctgagaga 240 
gtgaaacagg cggcgcatga acgttcaaac aatatcattc atcaagcaga gcaagatgcg 300 
caacgcttgt tggaagaagc taaatataag gcaaacgaga ttcttcgtca agcaactgat 360 
5 aatgctaaga aagtcgctgt tgaaacagaa gaattgaaga acaagagccg tgtcttccac 420 
caacgtctca aatctacaat tgagagtcag ttggctattg ttgaatcttc agattgggaa 480 
gatattctcc gtccaacagc tacttatctt caaaccagtg atgaagcctt taaagaagtg 54 0 
gttagcgaag tacttggaga accgattcca gctccaattg aagaagaacc aattgatatg 600 
acacgtcagt tctctcaagc agaaatggca gaattacaag ctcgtattga ggtagccgat 660 
10 aaagaattgt ctgaatttga agctcagatt aaacaggaag tggaagctcc aactcctgta 720 
gtgagtcctc aagttgaaga agagcctctg ctcatccagt tggcccaatg tatgaagaac 780 
cagaagtag 739 

<210> 111 
15 <211> 1728 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 111 

20 atgtctaatg gacaactaat ttatttaatg gttgcaattg cagtcatttt agttctggct 60 
tatgtagtgg caatctttct acgtaagcga aacgagggga gattagaggc gctagaagaa 120 
agaaaagaag aactatacaa tcttccagta aatgatgaag tagaagctgt aaaaaatatg 180 
cacttgattg gacaaagtca agtggctttc cgtgaatgga atcaaaaatg ggtcgattta 240 
tctctcaact cttttgccga tattgaaaat aatctctttg aagcagaagg ttataaccat 300 

25 tcatttcgtt ttctcaaggc cagtcatcaa attgaccaaa ttgagagtca aattactttg 360 
attgaagaag atattgcggc aattcgcaat gctttggcag acttagagaa gcaagaatct 420 
aaaaatagtg gtcgtgttct tcatgctttg gatttatttg aggaacttca gcatagagtt 4 80 
gctgaaaatt cagaacagta tggtcaagcc ttggatgaaa ttgaaaaaca attagaaaat 540 
atccaatctg aattttcaca atttgtaacc ttgaattcat cgggtgaccc tgtggaagcc 600 

30 gcagtgattt tggataatac agaaaatcac attttggcct taagtcatat tgtggatcgt 660 
gttccagcct tggttacgac gctttctaca gaattgccag atcaattaca ggatttggaa 720 
gccggttatc gtaaactaat tgatgctaat tatcattttg ttgaaacgga tattgaagcg 780 
cgtttccact tgctttatga agcattcaag aaaaaccaag agaatattcg tcagttggaa 840 
ttggataatg ccgaatatga gaatggacag gcacaagagg aaatcaatgc cttgtatgat 900 

35 atttttactc gagaaattgc tgctcagaaa gtagtggaaa atctacttgc aactcttcca 960 
acttaccttc aacatatgaa agagaataat actttattgg ga'gaagatat tgcacgtttg 1020 
aacaagacct atttacttcc tgagacagct gcaagccatg ttcgtcgtat tcagacagaa 1080 
ttagagagtt ttgaggcagc tattgttgag gtaacttcaa atcaagaaga accaacccaa 1140 
gcttattcag ttcttgaaga aaatcttgag gatttacaaa ctcaactaaa agatattgaa- 1200 
40 gatgagcaaa tttcagttag tgagcgcctg acacaaattg agaaagatga tattaatgca 1260 
cgtcaaaagg ccaatgttta tgtcaatcgt ctccatacta tcaagcgata catggaaaaa 1320 
cgcaatctgc caggtattcc acaaactttc ttgaagttat tctttacggc aagcaataat 1380 
accgaggatt taatggttga gttagaacaa aaaatgatta acattgaatc tgttacccga 1440. 
gttcttgaaa ttgcaacgaa tgatatggaa gctttagaaa cggaaactta taatattgta 1500 
45 caatatgcaa ctttgacaga gcaactcttg caatattcta accgctatcg ctcatttgat 1560 
gaacgcattc aagaagcatt taacgaagct ttagatattt ttgaaaaaga atttgattat 1620 
cacgcttcat ttgataagat ttctcaagca ttggaagtgg cagagcctgg tgtaaccaat 1680 
cgctttgtta cctcatatga gaaaacacgt gaaacgattc gtttttaa 1728 

50 <210> 112 
<211> 2403 
<212> DNA 

<213> Streptococcus pneumoniae 
55 <400> 112 

atgcttatat cttataaatg gttaaaagaa ttggtggaca ttgatgtgcc atcacaagag 60 
ttggctgaaa aaatgtcaac tacaggaatc gaggtagagg gtgtcgaatc accagctgct 120 
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ggtctctcaa aaattgtcgt cggtgaggtc 
ctccatgttt gtcaggttaa cgttggcgaa 
aatgtgcgtg ctgggatcaa ggtcatggtg 
tacaaaatca aaaaaggaaa aatccgtggt 
5 ggtgaattgg gaatttctga ctcagttgtg 
ttgcctgaag atgccgtgcc atjgtgaggaa 
atcatcgaac tttccatcac accaaaccgt 
cacgaagtgg cagccatcta tgacaaggca 
actaatgaag ctgcggcaga tgccctttct 

10 tatgcagctc gtatcttgga caatgtgacc 
cttctcatga acgaaggaat ccgtcccatc 
ctgctctatt ttggtcaacc aat.gcatgcc 
atccgtgtgc gtgaagcgcg tgctggtgaa 
gacttggacg tgaatgacct agtcatcact 

15 gtcatgggtg gtcaagcaac agaaatctct 
gctgttttca atggcaaatc tatccgtaag 
tcatcttctc gctttgaaaa aggaattaat 
gcagctagcc tgattgcgga acttgcaggt 
ggtgagcttg atacttcaga tgtagaagtt 

20 ctcggaactg agctgtctta tgctgatgta 
ctttctggaa atgcagacag ctttacagtc 
atcgaagctg acctctttga agaaattgct 
agtctaccaa aagacgatgg tacagcaggt 
caagttcgta ctattgctga aggagcaggt 

25 actcctgaaa aagcagttgra gtttacggct 
ccaatgactg tggatcgttc agtcctccgt 
gttgcctaca acgtggctcg taagaataaa 
tttgaacaaa caggtaatcc aaaagaagaa 
gccttgacag gcttggttgc tgaaaaagat 

30 ttctatgcta agggaatcct tgaagcccta 
acagcaacat ' ctgaaatcgc tagccttcat 
gaccaagttc ttggtttcct tggccaagtg 
ccagaaacgt atgtggctga gcttaacctt 
actccatttg tagaaatcac caaattcccg 

35 aaggcagaag tgactcatca agaagttgta 
ttgacagata tcaaactctt tgacgtcttc 
tcaatggctt atagcttgac cttccaaaat 
gcacgctata tggaaaaaat ccaagcatcg 
taa 

40 

<210> 113 
<211> 543 
<212> DNA 

<213> Streptococcus pneumoniae 

45 

<400> 113 

atgttagaaa acgatattaa aaaagtcctc 
aaaaaactag gtgctcaatt aactaaagat 
attttaaaag gatctattcc ttttatggct 

50 gaaatggact tcatgatggt ttctagctac 
aatattaaac aagatgtgac tcaagatatc 
atcattgata caggtcaaac tttgaagaat 
gcttctgtta aaattgcaac cttgttggat 
gcagactata cctgctttac tatcccaaat 

55 aaagaaaatt atcgtaatct tccttatatt 
tag 
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ttgtcttgcg aagatgtgcc agagactcac 180 
gaagagcgtc agatcgtttg tggtgcccca 240 
gctcttccag gagctcgtat cgctgataac 300 
ttggagtcac ttggaatgat ctgttcactt 360 
cctaaggaat tcgcagatgg catccaaatc 420 
gtcttttctt acctagactt ggatgatgaa 480 
gcagatgccc tttctatgtg tggagtggct 54 0 
gtcaacttta aagaatttac tctaacagaa 600 
gtcagcattg agacagacaa ggcgccttac 660 
atcgcaccaa gtccacaatg gttgcaaaac 720 
aataacgtag tggacgtgac caactacatc 780 
tttgacttgg ataactttga agggactgac 84 0 
aaattggtga ccttggaegg tgaagaacgt 900 
gtcgcagaca agccagtagc ccttgcaggt 960 
gaaaaatcta gtcgtgttgt ccttgaagct 1020 
acaagtggtc gcctgaacct tcgttctgag 1080 
gtggcaacag ttaatgaagc ccttgatgcg 1140 
gcgacggtgc gtaagggcat cgtttcagcg 1200 
tcttcaaccc ttgctgatgt taaccgtgtc 1260 
gaagacgtct tccgtcgtct tggctttggt 1320 
agagtcccac gtcgtcgttg ggatatcaca 1380 
cgtatctatg gttatgaccg cttgccaact 1440 
gaattgacag ccacacaaaa actccgccgt 1500 
ttgacagaaa tcatcaccta tactctaaca 1560 
caaccaagta accttacgga actcatgtgg 1620 
caaaatatga tttcaggtat ccttgatacc 1680 
aacttggccc tttacgagat tggaaaagtc 1740 
cttccaaatg aaatcaacag ttttgccttt 1800 
ttccaaacag cagcagttcc agttgatttc 1860 
tttactcgtt tgggactcca agtaacctat 1920 
ccaggtcgta cagccgtgat ttcactcggt 1980 
catccagtca ctgccaaggc ttacgatatt 2040 
tcagctatcg aagctgcgct tcagccagcg 2100 
gcagtcagcc gtgacgttgc ccttctcctc 2160 
gatgctatcc aagctgccgg cgtgaaacgt 2220 
tcaggtgaga aattgggact tggtatgaag 2280 
ccagaagata gcttaacgga cgaagaagtc 234 0 
ctcgaagaaa aagtcaatgc agaagtgcgt 2-400 

2403 



gtttcacacg atgaaattac agaagcagct 60 
tatgcaggaa aaaatccaat cttagttggg 120 
gaattggtca aacatattga tacacatatt 180 
catggtggaa cagcaagtag tggtgttatc 240 
aaaggaagac atgttctatt tgtagaagat 300 
ttgcgagata tgtttaaagc aagagaagca 360 
aaaccagaag gacgtgttgt agaaattgag 420 
gagtttgtag taggttatgg tttagactac 480 
ggagtattga aagaggaagt gtattcaaat 540 

543 
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<210> 1X4 

<211> 235 

<212> PRT 

<213> Streptococcus pneumoniae 

<400> 114 

Met He Tyr Ala Gly He Leu Ala Gly Gly Thr Gly Thr Arg Met Gly 
1 5 10 15 

He Ser Asn Leu Pro Lys Gin Phe Leu Glu Leu Gly Asp Arg Pro He 
20 25 30 

Leu He His Thr lie Glu Lys Phe Val Leu Glu Pro Ser He Glu Lys 
35 40 45 

He Val Val Gly Val His Gly Asp Trp Val Leu His Ala Glu Asp Leu 
50 55 60 

Val Asp Lys Tyr Leu Pro Leu His Lys Glu Arg He He He Thr Lys 
65 70 75 80 



Gly Gly Ala Asp Arg Asn. Thr Ser He Glu Asn lie He Glu Ala He 
85 '90 95 

Asp Ala Tyr Arg Pro Leu Thr Pro Glu Asp He Val Val Thr His Asp 
100 105 110 

Ser Val Arg Pro Phe He Thr .Leu Arg Met He Gin Asp Ser He Lys 
115 120 125 

Leu Ala Gin Asn His Asp Ala Val Asp Thr Val Val Glu Ala Val Asp 
130 135 140 

Thr He Val Glu Ser Thr Asn Gly Gin Phe He Thr Gly He Pro Asn 
145 150 155 160 

Arg Ala His Leu Tyr Gin Gly Gin Thr Pro Gin Thr Phe Arg Cys Lys 
165 -170 175 

Asp Phe Met Asp Leu Tyr Gly Ser Leu Ser Asp Glu Glu Lys Glu He 
180 185 190 

Leu Thr Asp Ala Cys Lys He Phe Val He Lys Gly Lys Asp Val Ala 
195 200 205 

Leu Ala Lys Gly Glu Tyr Ser Asn Leu Lys He Thr Thr Val Thr Asp 
210 215 220 

Leu Lys He Ala Lys Ser Met He Glu Lys Asp 
225 230 235 



<210> 115 
<211> 185 
<212> PRT 

<213> Streptococcus pneumoniae 
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<400> 115 

Met Ala Asn Val lie He Glu Lys Ala Lys Glu Arg Met Thr Gin Ser 
1*5 10 15 

His Gin Ser Leu Ala Arg Glu Phe Gly Gly He Arg Ala Gly Arg Ala 
20 25 30 

Asn Ala Ser Leu Leu Asp Arg Val His Val Glu Tyr Tyr Gly Val Glu 
35 40 45 

Thr Pro Leu Asn Gin He Ala Ser He Thr He Pro Glu Ala Arg Val 
50 55 . 60 

Leu Leu Val Thr Pro Phe Asp Lys Ser Ser Leu Lys Asp He Glu Arg 
65 70 75 80 

Ala Leu Asn Ala Ser Asp Leu Gly He Thr Pro Ala Asn Asp Gly Ser 
85 90 95 

Val He Arg Leu Val He Pro Ala Leu Thr Glu Glu Thr Arg Arg Asp 
100 105 110' 

Leu Ala Lys Glu Val Lys Lys Val Gly Glu Asn Ala Lys Val Ala Val 
115 120 125 

Arg Asn He Arg Arg Asp Ala Met Asp Glu Ala Lys Lys Gin Glu Lys 
130 135 140 

Ala Gin Glu He Thr Glu Asp Glu Leu Lys Thr Leu Glu Lys Asp He 
145 150 155 160 

Gin Lys Val Thr Asp Asp Ala Val Lys His He Asp Asp Met Thr Ala 
165 170 175 

Asn Lys Glu Lys Glu Leu Leu Glu Val 
180 185 



<210> 116 
<211> 450 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 116 

Met Gly Lys Tyr Phe Gly Thr Asp 
1 5 

Glu Leu Thr Pro Glu Leu Ala Phe 
20 

Val Leu Ser Gin His Glu Thr Glu 
35 40 

Asp Thr Arg He Ser Gly Glu Met 
50 55 

Leu Leu Ser Val Gly He His Val 



Gly Val Arg Gly Glu Ala Asn Leu 
. 10 15 

Lys Leu Gly Arg Phe Gly Gly Tyr 
25 30 

Ala Pro Lys Val Phe Val Gly Arg 
45 

Leu Glu Ser Ala Leu Val Ala Gly 
60 

Tyr Lys Leu Gly Val Leu Ala Thr 
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65 



70 



75 



80 



Pro Ala Val Ala Tyr Leu Val Glu Thr Glu Gly Ala Ser Ala Gly Val 
85 90 95 

5 

Met He Ser Ala Ser His Asn Pro Ala Leu Asp Asn Gly He Lys Phe 
100 105 110 

Phe Gly Gly Asp Gly Phe Lys Leu Asp Asp Glu Lys Glu Ala Glu He 
10 115 120 125 

Glu Ala Leu Leu Asp Ala Glu Glu Asp Thr Leu Pro Arg Pro Ser Ala 
130 135 140 

15 Glu Gly Leu Gly He Leu Val Asp Tyr Pro Glu Gly Leu Arg Lys Tyr 
145 150 155 160 



20 



Glu Gly Tyr Arg Val Ser Thr Gly Thr Pro Leu Asp Gly Met Lys Val 
165 170 175 

Ala Leu Asp Thr Ala Asn Gly Ala Ala Ser Thr Ser Ala Arg Gin He 
180 185 190 



Phe Ala Asp Leu Gly Ala Gin Leu Thr Val He Gly Glu Thr Pro Asp 
25 195 200 205 

Gly Leu Asn He Asn Leu Asn Val Gly Ser Thr His Pro Glu Ala Leu 
210 215 220 

30 Gin Glu Val Val Lys Glu Ser Gly Ser Ala He Gly Leu Ala Phe Asp 

225 230 235 240 



35 



Gly Asp Ser Asp Arg Leu He Ala Val Asp Glu Asn Gly Asp He Val 
245 250 . 255 

Asp Gly Asp Lys He Met Tyr He He Gly Lys Tyr Leu Ser Glu Lys 
260 265 . 270 



Gly Gin Leu Ala Gin Asn Thr He Val Thr Thr Val Met Ser Asn Leu 
40 275 280 285 

Gly Phe His Lys Ala Leu Asn Arg Glu Gly He Asn Lys Ala Val Thr 
290 295 300 

45 Ala Val Gly Asp Arg Tyr Val Val Glu Glu Met Arg Lys Ser Gly Tyr 
305 310 315 320 



50 



Asn Leu Gly Gly Glu Gin Ser Gly His -Val He Leu Met Asp Tyr Asn 
325 330 335 

Thr Thr Gly Asp Gly Gin Leu Ser Ala Val Gin Leu Thr Lys He Met 
340 345 350 



Lys Glu Thr Gly Lys Ser Leu Ser Glu Leu Ala Ala Glu Val Thr He 
55 355 360 365" 



Tyr Pro Gin Lys Leu Val Asn He Arg Val Glu Asn Val Met Lys Glu 
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370 



375 



380 



Lys Ala Met Glu Val Pro Ala He 
385 390 

Glu Glu Met Ala Gly Asn Gly Arg 
405 

Glu Pro Leu Leu Arg Val Met Ala 
420 

Asp Tyr Tyr Val Asp Thr He Thr 
435 440 



Lys Ala He He Glu Lys Met Glu 
395 400 

lie Leu Val Arg Pro Ser Gly Thr 
410 415 • 

Glu Ala Pro Thr Thr Glu Glu Val 
425 430 

Asp Val Val Arg Ala Glu He Gly 
445 ' 



He Asp 
450 



<210> 117 
<211> 234 
<212> PRT 

<213> Streptococcus pneumoniae 



<400> 117 

Met Lys Lys He Leu He Val Asp Asp Glu Lys Pro He Ser Asp He 
1 5 , 10 15 

He Lys Phe Asn Met Thr Lys Glu Gly Tyr Glu Val Val Thr Ala Phe 
20 25 30 

Asn Gly Arg Glu Ala Leu Glu Gin Phe Glu Ala Glu Gin Pro Asp He 
35 40 45 



He He Leu Asp Leu Met Leu Pro Glu He Asp Gly Leu Glu Val Ala 
50 55 60 

Lys Thr He Arg Lys Thr Ser Ser Val Pro He Leu Met Leu Ser Ala 
65 '70 75 80 

Lys Asp Ser Glu Phe Asp Lys Val He Gly Leu Glu Leu Gly Ala Asp 
85 90 95 



Asp Tyr Val Thr Lys Pro Phe Ser 
100 

Lys Ala Leu Leu Arg Arg Ser Gin 

115 120 

Ala Asp Ser Lys Pro Gin Pro He 
130 135 

Pro. Asp Ala Tyr Val Ala Lys Lys 
145 150 

His Arg Glu Phe Glu Leu Leu Tyr 
165 



Asn Arg Glu Leu Gin Ala Arg Val 
105 110 

Pro Met Pro Val Asp Gly Gin Glu 
125 

Gin He Gly Asp Leu Glu He Val 
140 

Tyr Gly Glu Glu Leu Asp Leu Thr 
155 160 

His Leu Ala Ser His Thr Gly Gin 
170 175 
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Val He Thr Arg Glu His Leu Leu Glu Thr Val Trp Gly Tyr Asp Tyr 
180 185 190 

Phe Gly .Asp Val Arg Thr Val Asp Val Thr Val Arg Arg Leu Arg Glu 
5 195 N 200 205 

Lys He Glu Asp Thr Pro Ser Arg Pro Glu Tyr He Leu Thr Arg Arg 
210 215 220 

10 Gly Val Gly Tyr Tyr Met Arg Asn Asn. Ala 
225 230 



<210> 118 
15 <211> 368 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 118 

20 Met Glu Glu He Leu Cys He Gly Cys Gly Ala Thr He Gin Thr Thr 
1 5 10 15 



25 



40 



55 



Asp Lys Ala Gly Leu Gly Phe Thr Pro Gin Ser Ala Leu Glu Lys Gly 
20 25 30 

Leu Glu Thr Gly Glu Val Tyr Cys Gin Arg Cys Phe Arg Leu Arg His 
35 40 45 



Tyr Asn Glu He Thr Asp Val Gin Leu Thr Asn Asp Asp Phe Leu Lys 
30 50 55 60 

Leu Leu His Glu Val Gly Asp Ser Asp Ala Leu Val Val Asn Val He 
65 70 75 80 

35 Asp He Phe Asp Phe Asn Gly Ser Val He Pro Gly Leu Pro Arg Phe 

85 .90 95 



Val Ser Gly Asn Asp Val Leu Leu Val Gly Asn Lys Lys Asp He Leu 
100 105 110 

Pro Lys Ser Val Lys Ser Gly Lys He Ser Gin Trp Leu Met Lys Arg 
115 120 125 



Ala His Glu Glu Gly Leu Arg Pro Val Asp Val Val Leu Thr Ser Ala 
45 130 135 140 

Gin Asn Lys His Ala He Lys Glu Val lie Asp Lys He Glu His Tyr 
145 150 --- 155 160 

50 Arg Lys Gly Arg Asp Val Tyr Val Val Gly Val Thr Asn Val Gly Lys 

165 . 170 175 



Ser Thr Leu lie Asn Ala lie lie Gin Glu lie Thr Gly Asp Gin Asn 
180 185 190 

Val lie Thr Thr Ser Arg Phe Pro Gly Thr Thr Leu' Asp Lys He Glu 
195 200 205 
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He Pro Leu Asp Asp Gly Ser Tyr lie Tyr Asp Thr Pro Gly He He 
210 215 220 

His Arg His Gin Met Ala His Tyr Leu Thr Ala Lys Asn Leu Lys Tyr 
225 230 235 240 

Val Ser Pro Lys Lys Glu He Lys Pro Lys Thr Tyr Gin Leu Asn Pro 
245 250 255 

Glu Gin Thr Leu Phe Leu Gly Gly Leu Gly Arg Phe Asp Phe He Ala 
260 265 270 

Gly Glu Lys Gin Gly Phe Thr Ala Phe Phe Asp Asn Glu Leu Lys Leu 
275 280 285 

His Arg Ser Lys Leu Glu Gly Ala Ser Ala Phe Tyr Asp Lys His Leu 
290 295 300 

Gly Thr Leu Leu Thr Pro Pro Asn Ser Lys Glu Lys Glu Asp Phe Pro 
305 310 315 320 

Arg Leu Val Gin His Val Phe Thr He Lys Asp Lys Thr Asp Leu Val 
325 330 335 

He Ser Gly Leu Gly Trp He Arg Val Thr Gly Thr Ala Lys Val Ala 
340 345 350 



Val Trp Ala Pro Glu Gly Val Ala Val Val Thr Arg Lys Ala He He 
355 360 365 



<210> 119 
<211> 486 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 119 

Met Tyr Pro Asp Asp Ser Leu Thr 
1 5 

Asn Met Met Gin Val Tyr Phe Asp 
20 

Val Phe Glu Val Tyr Phe Arg Gin 

.35* 40 

Val Phe Ala Gly Leu Glu Arg He 
50 55 

Phe Ser Asp Ser Asp He Ala Tyr 
65 70 

Ala Phe Leu Asp Tyr Leu Arg Asn 



Leu His Thr Asp Leu Tyr Gin He 
10 15 

Gin Gly He His Asn Lys Lys Ala 
25 30 

Gin Pro Phe Lys Asn Gly Tyr Ala 
45 

Val Asn Tyr Leu Glu Asp Leu Arg 
60 

Leu Glu Ser Leu Gly Tyr His Gly 
75 80 

Phe Lys Leu Glu Leu Thr Val Arg . 
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85 90 95 

Ser Ala Gin Glu Gly Asp Leu Val Phe Ala Asn Glu Pro lie Val Gin 
100 105 110 

5 

Val Glu Gly Pro Leu Ala Gin Cys Gin Leu Val Glu Thr Ala Leu Leu 
115 120 125 

Asn lie Val Asn Tyr Gin Thr Leu Val Ala Thr Lys Ala Ala Arg lie 
10 130 135 140 

Arg Ser Val lie Glu Asp Glu Pro Leu Met Glu Phe Gly Thr Arg Arg 
145 150 155 160 

15 Ala Gin Glu Thr Asp Ala Ala lie Trp Gly Thr Arg Ala Ala Val He 

165 170 175 



20 



35 



50 



Gly Gly Ala Asn Gly Thr Ser Asn Val Arg Ala Gly Lys Leu Phe Asp 
180 185 190 

He Pro Val Leu Gly Thr His Ala His Ala Leu Val Gin Val Tyr Gly 
195 200 205 



Asn Asp Tyr Glu Ala Phe Lys Ala Tyr Ala Ala Thr His Lys Asn Cys 

25 210 215 220 

Val Phe Leu Val Asp Thr Tyr Asp Thr Leu Arg He Gly Val Pro Ala 

225 230 235 240 

30 Ala He Gin Val Ala Arg Glu Leu Gly Asp Gin He Asn Phe Met Gly 

245 250 255 



Val Arg He Asp Ser Gly Asp He Ala Tyr He Ser Lys Lys Val Arg 

260 265 270 

Gin Gin Leu Asp Glu Ala Gly Phe Thr Glu Ala Lys He Tyr Ala Ser 

275' 280 285 



Asn Asp Leu Asp Glu Asn Thr He Leu Asn Leu Lys Met Gin Lys Ala 
40 290 295 300 

Lys lie Asp Val Trp Gly Val Gly Thr Gin Leu He Thr Ala Tyr Asp 

305 310 315 320 

45 Gin Pro Ala Leu Gly Ala Val Tyr Lys He Val Ala He Glu Asp Glu 

325 330 335 



Thr Gly Gin Met Arg Asn Thr He Lys - Leu Ser Asn Asn Ala Glu Lys 
340 345 350 

Val Ser Thr Pro Gly Lys Lys Gin Val Trp Arg He Thr Ser Arg Glu 
355 360 365 



Lys' Gly Lys Ser Glu Gly Asp Tyr He Thr Tyr Asp Gly Val Asp He 
55 370 375 380 

Ser Asp Met Thr Glu He Lys Met Phe His Pro Thr Tyr Thr Tyr He 
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385 



390 



395 



400 



Lys 



Lys 



Thr Val 



Arg Asn 
4 05 



Phe 



Asp Ala Val Pro Leu Leu Val Asp lie 
410 415 



Phe 



Lys 



Glu Gly 
420 



lie Leu 



Val 



Tyr Asn Leu Pro Ser Leu Thr Asp lie 
425 430 



Gin Asp Tyr Ala Arg Lys Glu Phe Asp Lys Leu Trp Asp Glu Tyr Lys 
435 440 445 

Arg Val Leu Asn Pro Gin His Tyr Pro Val Asp Leu Ala Arg Asp Val 
450 455. 460 

Trp Gin Asp Lys Met Asp Leu lie Asp Lys Met Arg Lys Glu Ala Leu 
465 . 470' 475 480 

Gly Glu Gly Glu Glu Glu 



<210> 120 
<211> 283 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 120 

Met Ala Thr lie Gin Trp Phe Pro Gly His Met Ser Lys Ala Arg Arg 
15 10 15 

Gin Val Gin Glu Asn Leu Lys Phe Val Asp Phe Val Thr He Leu Val 
20 * 25 30 

Asp Ala Arg Leu Pro Leu Ser Ser Gin Asn Pro Met Leu Thr Lys He 
35 40 45 

Val Gly Asp Lys Pro Lys Leu Leu He Leu Asn Lys Ala Asp Leu Ala 
50 55 60 

Asp Pro Ala Met Thr Lys Glu Trp Arg Gin Tyr Phe Glu Ser Gin Gly 
65 70 75-80 

He Gin Thr Leu Ala He Asn Ser Lys Glu Gin Val Thr Val Lys Val 
85 ■ ' 90 95 

Val Thr Asp Ala Ala Lys Lys Leu Met Ala Asp Lys He Ala Arg Gin 
100 105 110 

Lys Glu Arg Gly He Gin He Glu Thr Leu Arg Thr Met He He Gly 
115 120 . 125 

He Pro Asn Ala Gly Lys Ser Thr Leu Met Asn Arg Leu Ala Gly Lys 
130 135 140 

Lys He. Ala Val Val Gly Asn Lys Pro Gly Val Thr Lys Gly Gin Gin 
145 150 155 160 



485 



56 



WO 01/49721 



PCT7US00/35604 



Trp Leu Lys Thr Asn Lys Asp Leu Glu lie Leu Asp Thr Pro Gly He 
165 170 175 

Leu Trp Pro Lys Phe Glu Asp Glu Thr Val Ala Leu Lys Leu Ala Leu 
180 185 190 

Thr Gly Ala He Lys Asp Gin Leu Leu Pro Met Asp Glu Val Thr He 
195 200 205 

Phe Gly He Asn Tyr Phe Lys Glu His Tyr Pro Glu Lys Leu Ala Glu 
210 215 220 

Arg Phe Lys Gin Met Lys He Glu Glu Glu Pro Ser Val He He- Met 
225 230 . 235 240 

Asp Met Thr Arg Ala Leu Gly Phe Arg Asp Asp Tyr Asp Arg Phe Tyr 
245 -.250 255. 

Ser Leu Phe Val Lys Glu Val Arg Asp Gly Lys Leu Gly Asn Tyr Thr 
260 * 265 270 

Leu Asp Thr Leu Glu Asp Leu Asp Gly Asn Asp 
275 280 



<210> 121 
<211> 156 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 121 

Met He Asn Asn Val . Val' Leu Val Gly Arg Met Thr Arg Asp Ala Glu 
1 " 5 , 10 15 

Leu Arg Tyr Thr Pro Ser Asn Val Ala Val Ala Thr Phe Thr Leu Ala 
20 25 30 

Val Asn Arg Thr Phe Lys Ser Gin Asn Gly Glu Arg Glu Ala Asp Phe 
' 35 40 45 

lie Asn Val Val Met Trp Arg Gin Gin Ala Glu Asn Leu Ala Asn Trp 
50 55 60 

Ala Lys Lys Gly Ser Leu He Gly Val Thr Gly Arg He Gin Thr Arg 
65 70 75 ' 80 

Ser Tyr Asp Asn Gin Gin Gly Gin Arg Val Tyr Val Thr Glu Val Val 
85 " 90 95 

Ala Glu Asn. Phe Gin Met Leu Glu. Ser Arg Ser Val Arg Glu Gly His 
100 105 110' 

Thr Gly Gly Ala Tyr Ser Ala Pro Thr Ala Asn Tyr Ser Ala -Pro Thr 
115 120 125 

Asn Ser Val Pro Asp Phe Ser Arg Asn Glu Asn Pro Phe Gly Ala Thr 
130 135 140 
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Asn Pro Leu Asp He Ser Asp Asp Asp Leu Pro Phe 
145 150 155 



<210> 122 
<211> 324 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 122 

Met Lys Thr Arg He Thr Glu Leu Leu Lys He Asp Tyr Pro He Phe 
15 10 15 

Gin Gly Gly Met Ala Trp Val Ala Asp Gly Asp Leu Ala Gly Ala Val 
20 25 30 



Ser Lys Ala Gly Gly Leu Gly He He Gly Gly Gly Asn Ala Pro Lys 
35 40 45 

Glu Val Val Lys Ala Asn He Asp Lys He Lys Ser Leu Thr Asp Lys 
50 55 60 

Pro Phe Gly Val Asn He Met Leu Leu Ser Pro Phe Val Glu Asp He 
65 70 75 80 

Val Asp Leu Val He Glu Glu Gly Val Lys Val Val Thr Thr Gly Ala 
85 90 95 

Gly Asn Pro Ser Lys Tyr Met Glu Arg Phe His Glu Ala Gly He He 
100 105 110 

Val He Pro Val Val Pro Ser Val Ala Leu Ala Lys Arg Met Glu Lys 
115 . 120 125 

He Gly Ala Asp Ala Val He Ala Glu Gly Met Glu Ala Gly Gly His 
130 135 140 

He Gly Lys Leu Thr Thr Met Thr Leu Val Arg Gin Val Ala Thr Ala 
145 150 155 . 160 

He Ser He Pro Val He Ala Ala Gly Gly He Ala Asp Gly Glu Gly 
165 170 175 

Ala Ala Ala Gly Phe Met Leu Gly Ala Glu Ala Val Gin Val Gly Thr 
180 185 190 

Arg Phe Val. Val- Ala Lys Glu Sex Asn" Ala His Pro Asn Tyr Lys Glu 
195 200 205 

Lys He Leu Lys Ala Arg Asp lie Asp Thr Thr He Ser Ala Gin His 
210 215 * 220 

Phe Gly His Ala Val Arg Ala He Lys Asn Gin Leu Thr Arg Asp Phe 
225 230 235 240 



Glu Leu Ala Glu Lys Asp Ala Phe Lys Gin Glu Asp Pro Asp Leu Glu 
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245 250 255 

lie Phe Glu Gin Met Gly Ala Gly Ala Leu Ala Lys Ala Val Val His 
260 265 270 

5 

Gly Asp Val Glu Gly Gly Ser Val Met Ala Gly Gin lie Ala Gly Leu 
275 280 285 

Val Ser Lys Glu Glu Thr Ala Glu- Glu lie Leu Lys Asp Leu Tyr Tyr 
10 290 295 300 

Gly Ala Ala Lys Lys lie Gin Glu Glu Ala Ser Arg Trp Thr Gly Val 
305 * 310 315 320 

15 Val Arg Asn Asp 



<210> 123 
20 <211> 140 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 123 

25 Met lie Asp lie Gin Gly lie Lys Glu Ala Leu Pro His Arg Tyr Pro 
1 5 10 15 

Met Leu Leu Val Asp Arg Val Leu Glu Val Ser Glu Asp Thr lie Val 
20 25 30 

Ala lie Lys Asn Val Thr lie Asn Glu Pro Phe Phe Asn Gly His Phe 
- 35 40 45 



30 



Pro Gin Tyr Pro Val Met Pro Gly Val Leu lie Met Glu Ala Leu Ala 

35 50 55 60 

Gin Thr Ala Gly Val Leu Glu Leu Ser Lys Pro Glu Asn Lys Gly Lys 

65" -70 75 80 

40 Leu Val Phe Tyr Ala Gly Met Asp Lys Val Lys Phe Lys Lys Gin Val 

85 90 95 



45 



Val Pro Gly Asp Gin Leu Val Met Thr Ala Thr Phe Val Lys Arg Arg 
100 105 110 

Gly Thr He Ala Val Val Giu Ala Lys Ala Glu Val Asp Gly Lys Leu 
115 120 125 



Ala Ala Ser Gly Thr Leu Thr Phe Ala He Gly Asn 
50 130 135 140 



<210> 124 
<211> 340 
55 <212> PRT 

<213> Streptococcus pneumoniae 
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<400> 124 

Met He Asn Gin He Tyr Gin Leu Thr Lys Pro Lys Phe He Asn .Val 
1 5 10 15 

Lys Tyr Gin Glu Glu Ala He Asp Gin Glu Asn His He Leu He Arg 
20 25 30 

Pro Asn Tyr Met Ala Val Cys His Ala Asp Gin Arg Tyr Tyr Gin Gly 
35 40 45 

Lys Arg Asp Pro Lys He Leu Asn Lys Lys Leu Pro Met Ala Met He 
50 55 t 60 

His Glu Ser Cys Gly lie Val He Ser Asp Pro Ser Gly Thr Tyr Glu 
65 70 75 80 

Val Gly Gin Lys Val Val Met He Pro Asn Gin Ser Pro Met Gin Ser 
85 90 95 

Asp Glu Glu Phe Tyr Glu Asn Tyr Met Thr Gly Thr His Phe Leu Ser 
100 105 110 

Ser Gly Phe Asp Gly Phe Met Arg Glu Phe Val Ser Leu Pro Lys Asp 
115 120 125 

Arg Val Val Ala Tyr Asp Ala He Glu Asp Thr Val Ala Ala He Thr 
130 135 140 

Glu Phe Val Ser Val Gly Met His Ala Met Asn Arg Leu Leu Thr Leu 
145 150 155 160 

Ala His Ser Lys Arg Glu Arg lie Pro Val lie Gly Asp Gly Ser Leu 
165 170 175 

Ala Phe Val Val Ala Asn lie lie Asn Tyr Thr Leu Pro Glu Ala Glu 
180 185 190 

lie Val Val lie Gly Arg His Trp Glu Lys Leu Glu Leu Phe Ser Phe 
195 200 205 

Ala Lys Glu Cys Tyr He Thr Asp Asn lie Pro Glu Glu Leu, Ala Phe 
210 215 ' 220 

Asp His Ala Phe Glu Cys Cys Gly Gly Asp Gly Thr Gly Pro Ala lie 
225 230 235 240 

Asn Asp Leu He Arg Tyr He Arg Pro Gin Gly Thr lie Leu Met Met 
245 - 250 ' 255 

Gly Val Ser Glu Tyr Lys Val Asn Leu Asn Thr Arg Asp Ala Leu Glu 
260 265 270 

Lys Gly Leu Leu Leu Val Gly Ser Ser Arg Ser Gly Arg lie Asp Phe 
275 280 285 



Glu Asn Ala lie Gin Met Met Lys Val Lys Lys Phe Ala Asn Arg Leu 
290 295 300 
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Lys Asn lie Leu Tyr Leu Glu Glu Pro Val Arg Glu lie Lys Asp He 
"305 310 315 320 

His Arg Val Phe Ala Thr Asp Leu Asn Thr Ala Phe Lys Thr Val Phe 
325 330 335 

Lys Trp Glu Val 
340 



<210> 125 
<211> 447 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 125 

Met Asn Leu Lys Thr Thr Leu Gly Leu Leu Ala Gly Arg Ser Ser His 
15 10 15 



Phe Val Leu Ser Arg Leu Gly Arg Gly Ser Thr Leu' Pro Gly Lys Val 
20 25 30 

Ala Leu Gin Phe Asp Lys Asp He Leu Gin Asn Leu Ala Lys Asn Tyr 
35 ' 40 45 

Glu He Val Val Val Thr Gly Thr Asn Gly Lys Thr Leu Thr Thr Ala 
50 55 60 



Leu Thr Val Gly He Leu Lys Glu Val Tyr Gly Gin Val Leu Thr Asn 

65 70 75 80 

Pro Ser Gly Ala Asn Met He Thr Gly He Ala Thr Thr Phe Leu Thr 

85 90 95 



Ala Lys Ser Ser Lys Thr Gly Lys Asn lie Ala Val Leu Glu He Asp 
100 105 110 

Glu Ala Ser Leu Ser Arg He Cys Asp Tyr He Gin Pro Ser Leu Phe 
115 120 125 

Val He Thr Asn He Phe Arg Asp Gin Met Asp Arg Phe Gly Glu He 
130 135 140 

Tyr Thr Thr Tyr Asn Met He Leu Asp Ala He Arg Lys Val Pro Thr 
145 150 155 -160 

Ala Thr Val Leu Leu Asn Gly Asp Ser-. Pro Leu Phe Tyr Lys Pro Thr 
165 170 175 

He Pro Asn Pro He Glu Tyr Phe Gly Phe Asp Leu Glu Lys Gly Pro 
180 185 190 

Ala Gin Leu Ala His Tyr Asn Thr Glu Gly He Leu Cys Pro Asp Cys 
195 200. 205 



Gin Gly He Leu Lys Tyr Glu His Asn Thr Tyr Ala Asn Leu Gly Ala 
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210 215 220 

Tyr lie Cys Glu Gly Cys Gly Cys Lys Arg. Pro Asp Leu Asp Tyr Arg 
225 230 235 240 

Leu Thr Lys Leu Val Glu Leu Thr Asn Asn Arg Ser Arg Phe Val He 
245 250 255 

Asp Gly Gin Glu Tyr Gly He Gin lie Gly Gly Leu Tyr Asn He Tyr 
260 265 270 

Asn Ala Leu Ala Ala Val Ala He Ala Arg Phe Leu Gly Ala Asp Ser 
275 280 285 

Gin Leu He Lys Gin Gly Phe Asp Lys Ser Arg Ala Val Phe Gly Arg 
290 295 300 

Gin Glu Thr Phe His He Gly Asp Lys Glu Cys Thr Leu Val Leu He 
305 310 315 320 

Lys Asn Pro Val Gly Ala Thr Gin Ala He Glu Met He Lys Leu Ala 
325 330 335 

Pro Tyr Pro Phe Ser Leu Ser Val Leu Leu Asn* Ala Asn Tyr Ala Asp 
340 345 350 

Gly He Asp Thr Ser Trp He Trp Asp Ala Asp Phe Glu Gin He Thr 
355 360 365 

Asp Met Asp He Pro Glu He Asn Ala Gly Gly Val Arg His Ser Glu 
370 375 380 

He Ala Arg Arg Leu Arg Val Thr Gly Tyr Pro Ala Glu Lys He Thr 
385 390 395 400 

Glu Thr Ser Asn Leu Glu Gin Val Leu Lys Thr He Glu Asn Gin Asp 
405 410 415 

Cys Lys His Ala Tyr He Leu Ala Thr Tyr Thr Ala Met Leu Glu Phe 
420 425 430 

Arg Glu Leu Leu Ala Ser Arg Gin He Val Arg Lys Glu Met Asn 
435 440 445 



<210> 126 
<211> 260 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 126 

Met Val Tyr Thr Ser Leu Ser Ser Lys Asp Gly Asn Tyr Pro Tyr Gin 
15 10 15 

Leu Asn He Ala His Leu Tyr Gly Asn .Leu Met Asn Thr Tyr Gly Asp 
20 25 30 



62 



WO 01/49721 



PCTAJSOO/35604 



Asn Gly Asn lie Leu Met Leu Lys Tyr Val Ala Glu Lys Leu Gly Ala 
35 40 45 

His Val Thr Val Asp lie Val Ser Leu His Asp Asp Phe Asp Glu Asn 
5 50 55 60 

His Tyr Asp lie Ala Phe Phe Gly Gly Gly Gin Asp Phe Glu Gin Ser 
65 .70 75 80 

10 lie lie Ala Asp Asp Leu Pro Ala Lys Lys Glu Ser lie Asp Asn Tyr 

85 90 95 

He Gin Asn Asp Gly Val Val Leu Ala He Cys Gly Gly Phe Gin Leu 
100 105 110 

15 

Leu Gly Gin Tyr Tyr Val Glu Ala Ser Gly Lys Arg He Glu Gly Leu 
115 120 125 

Gly Val Met Gly His Tyr Thr Leu Asn Gin Thr Asn Asn Arg Phe He 
20 . 130 135 140 

Gly Asp He Lys He His Asn Glu Asp Phe Asp Glu Thr Tyr Tyr Gly 
145 150 155 160 

25 Phe Glu Asn His Gin Gly Arg Thr Phe Leu Ser Asp Asp Gin Lys Pro 

165 170 175. 

Leu Gly Gin Val Val Tyr Gly Asn Gly Asn Asn Glu Glu Lys Val Gly 
180 185 190 

30 

Glu Gly Val His Tyr. Lys Asn Val Phe Gly Ser Tyr Phe His Gly Pro 
195 200 205 

He Leu Ser Arg Asn Ala Asn Leu Ala Tyr Arg Leu Val Thr Thr Ala 
35 210 215 220 

' Leu Lys Lys Lys Tyr Gly Gin Asp He Gin Leu Pro Ala Tyr Glu Asp 
225 230 235 240 

40 lie Leu Ser Gin Glu He Ala Glu Glu Tyr Ser Asp Val Lys Ser Lys 

245 • 250 255 

Ala Asp Phe Ser 
260 

45 

<210> 127 • * 

<211> 223 
<212> PRT 
50 <213> Streptococcus pneumoniae 

<400> 127 

Met Asn Val Lys Glu Asn Thr Glu Leu Val Phe Arg Glu Val Ala Glu 
15 10 15 

55 

Ala Ser Leu Ser Ala Asn Arg Glu Ser Gly Ser Val Ser Val lie Ala 
20 25 30 
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Val Thr Lys Tyr Val Asp Val Pro Thr Ala Glu Ala Leu Leu Pro Leu 
35 40 45 

Gly Val His His lie Gly Glu Asn Arg Val Asp Lys Phe Leu Glu Lys 
50 55 60 

Tyr Glu Ala Leu Lys Asp Arg Asp Val Thr Trp His Leu lie Gly Thr 
65 70 75 80 



Leu Gin Arg Arg Lys Val Lys Asp Val lie Gin Tyr Val Asp Tyr Phe 
85 90 95 

His Ala Leu Asp Ser Val Lys Leu Ala Gly Glu lie Gin Lys Arg Ser 
100 105 110 

Asp Arg Val lie Lys Cys Phe Leu Gin Val Asn lie Ser Lys Glu Glu 
115 120 125 

Ser Lys His Gly Phe Ser Arg Glu Glu Leu Leu Glu lie Leu Pro Glu 
130 135 140 

Leu Ala Gly Leu Asp Lys lie Glu Tyr Val Gly Leu Met Thr Met Ala 
145 150 155 160 

Pro Phe Glu Ala Ser Ser Glu Gin Leu Lys Glu lie Phe Lys Ala Ala 
165 170 175 

Gin Asp Leu Gin Arg Glu lie Gin Glu Lys Gin lie Pro Asn lie Pro 
180 .185 190 

Met Thr Glu Leu Ser Met Gly Met Ser Arg Asp Tyr Lys Glu Ala lie 
195 " 200 205 



Gin Phe Gly Ser Thr Phe Val' Arg lie Gly Thr Ser Phe Phe Lys 
210 ' 215 220 



<210> 128 
<211> 279 
<212> PRT 

<213> Streptococcus, pneumoniae 
<400> 128* 

Met Gly He Ala Leu Glu Asn Val 
1 5 

Pro Leu Ala Ser Ala Ala Leu Ser 
20 

Gly Ser Tyr Thr Ala Leu He Gly 
35 40 

He Leu Gin Leu Leu Asn Gly Leu 
50 55 

Arg Val Phe Asp Thr Leu lie Thr 



Asn Phe Thr Tyr Gin Glu Gly Thr 
10 15 

Asp Val Ser Leu Thr He Glu Asp 
25 30 

His Thr Gly Ser Gly Lys Ser Thr 
45 

Leu Val Pro Ser Gin Gly Ser Val 
60 

Ser Thr Ser Lys Asn Lys Asp lie 
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65 



70 



75 



80 



Arg Gin lie Arg Lys Gin Val Gly Leu Val Phe Gin Phe Ala Glu Asn 
85 90 95 

5 

Gin lie Phe Glu Glu Thr Val Leu Lys Asp Val Ala Phe Gly Pro Gin 
100 105 110 

Asn Phe Gly Val Ser Glu Glu Asp Ala Val Lys Thr Ala Arg Glu Lys 
10 115 120 125 

Leu Ala Leu Val Gly lie Asp Glu Ser Leu Phe Asp Arg Ser Pro Phe 
130 135 140 

15 ■ Glu Leu Ser Gly Gly Gin Met Arg Arg Val Ala lie Ala Gly lie Leu 
145 150 155 160 



20 



Ala Met Glu Pro Ala lie Leu Val Leu Asp Glu Pro Thr Ala Gly Leu 
165 170 175 

Asp Pro. Leu Gly Arg Lys Glu Leu Met Thr Leu Phe Lys Lys Leu His 
180 185 190 



Gin Ser Gly Met Thr He Val Leu Val Thr His Leu Met Asp Asp Val 
25 195 200 - 205 



Ala Glu Tyr Ala Asn Gin Val Tyr Val Met Glu Lys Gly Arg Leu Val 
210 215 220 



30 Lys Gly Gly Lys Pro Ser Asp 
225 230 

Glu Val Gin Leu Gly Val Pro 
245 

35 

Ala Asp Arg Gly Val Ser Phe 
260 

Phe Lys Glu Ser Leu Asn Gly 
40 275 



Val Phe Gin Asp Val Val Phe Met Glu 
235 240 

Lys He Thr Ala Phe Cys Lys Arg Leu 
250 255 

Lys Arg Leu Pro Val Lys lie Glu Glu 
265 270 



<210> 129 
<211> 309 
45 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 129 

Met Asp He Gin Phe Leu Gly Thr Gly Ala Gly Gin Pro Ser Lys Ala 
50 1 5 10 - 15 

Arg Asn Val Ser Ser Leu Ala Leu Lys Leu Leu Asp Glu He Asn Glu 
20 - 25 30 

55 Val Trp Leu Phe Asp Cys Gly Glu Gly Thr Gin Asn Arg lie Leu Glu 
35 40 45 
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Thr Thr He Arg Pro Arg Lys Val Ser Lys He Phe He Thr His Leu 
50 55 60 

His Gly Asp His He Phe Gly Leu Pro Gly Phe Leu Ser Ser Arg Ala 
5 65 70 75 80 

Phe Gin Ala Asn Glu Glu Gin Thr Asp Leu Glu He Tyr -Gly Pro Gin 
85 90 95 

10 Gly He Lys Ser Phe Val Leu Thr Ser Leu Arg Val Ser Gly Ser Arg 
100 105 110 



15 



30 



45 



Leu Pro Tyr Arg He His Phe His Glu Phe Asp Gin Asp Ser Leu Gly 

115 120 125 

Lys He Leu Glu He Asp Lys Phe Thr Val Tyr Ala Glu Glu Leu Asp 
130 135 140 



His Thr He Phe Cys Val Gly Tyr Arg Val Met Gin Lys Asp Leu Glu 
20 145 150 155 • 160 

.Gly Thr Leu Asp Ala Glu Lys Leu Lys Ala Ala Gly Val Pro Phe Gly 
165 170 175 

25 Pro Leu Phe Gly Lys He .Lys Asn Gly Gin Asp Leu Val Leu Glu Asp 
180 185 190 



Gly Thr Glu lie Lys Ala Ala Asp Tyr lie Ser Ala Pro Arg Pro Gly 
195 200 205 

Lys He lie Thr lie Leu Gly Asp Thr Arg Lys Thr Asp Ala Ser Val 
210 215 220 



Arg Leu Ala Val Asn Ala Asp Val Leu Val His Glu Ser Thr Tyr Gly 
35 225 230 235 240 

Lys Gly Asp Glu Lys He Ala Arg Asn His Gly His Ser Thr Asn Met 
245 250 255 

40 Gin Ala Ala Gin Val Ala Val Glu Ala Gly Ala Lys Arg Leu Leu Leu 
260 265 270 



Asn His He Ser Ala Arg Phe Leu Ser Lys Asp lie Ser Lys Leu Lys 
275 280 285 

Lys Asp Ala Ala Thr He Phe Glu Asn Val His Val Val hys Asp Leu 
290 295 300 



Glu Glu Val Glu He 
50 305 



<210> 130 
<211> 553 
55 <212> PRT 

<213> Streptococcus pneumoniae 
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<400> 130 

Met Ser Asn lie Ser Leu Thr Thr Leu Gly Gly Val Arg Glu Asn Gly 
15 10 15 

Lys Asn Met Tyr He Ala Glu He Gly Glu Ser lie Phe Val Leu Asn 
20 25 30 

Val Gly Leu Lys Tyr Pro Glu Asn Glu Gin Leu Gly Val Asp Val Val 
35 40 45 

He Pro Asn Met Asp Tyr Leu Phe Glu Asn Ser Asp Arg He Ala Gly 
50 55 60 

Val Phe Leu Thr His Gly His Ala Asp Ala He Gly Ala Leu Pro Tyr 
65 70 75 80 

Leu Leu Ala Glu Ala Lys Val Pro Val* Phe Gly Ser Glu Leu Thr He 
85 90 95 

Glu Leu Ala Lys Leu Phe Val Lys Gly Asn Asp Ala Val Lys Lys Phe 
100 105 110 

Asn Asp Phe His Val He Asp Glu Asn Thr Glu He Asp Phe Gly Gly 
115 120 ' 125 

Thr Val Val Ser Phe Phe Pro Thr Thr Tyr Ser Val Pro Glu Ser Leu 
130 135 140 

Gly He Val Leu Lys Thr Ser Glu Gly Ser He Val Tyr Thr Gly Asp 
145 150 155 160 

Phe Lys Phe Asp Gin Thr Ala Ser Glu Ser Tyr Ala Thr Asp Phe Ala 
165 170 175 

Arg Leu Ala Glu He Gly Arg Asp Gly Val Leu Ala Leu Leu Ser Asp 
180 185 190 

Ser Ala Asn Ala Asp Ser Asn He Gin Val Ala Ser Glu Ser Glu Val 
195 200 205 

Arg Asp Glu He Thr Gin Thr He Ala Asp Trp Glu Gly Arg He He 
210 215 220 

Val Ala Ala Val Ser Ser Asn Leu Ser Arg He Gin Gin He. Phe Asp- 
225 230 235 240 

Ala Ala Asp Lys Thr Gly Arg Arg He Val Leu Thr Gly Phe Asp He 
245 "250 255 

Glu Asn He Val Arg Thr Ala He Arg Leu Lys Lys Leu Ser Leu Ala 
260 265 270 

Asn Glu He Leu Leu He Lys Pro Lys Asp Met Ser Arg Phe Glu Asp 
275. 280 285 



His Glu Leu He He Leu Glu Thr Gly Arg Met Gly Glu Pro lie Asn 
290 295 300 
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Gly Leu Arg Lys Met Ser lie Gly Arg His Arg Tyr Val Glu lie Lys 
305 310 315 320 

5 Asp Gly Asp Leu Val Tyr lie Ala Thr Ala Pro Ser He Ala Lys Glu 

325 330 ' 335 

Ala Phe Val Ala Arg Val Glu Asn Met He' Tyr Gin Ala Gly Gly Val 
340 345 350 

10 

Val Lys Leu He Thr Gin Ser Leu His Val Ser Gly His Gly Asn Val 
355 360 365 

Arg Asp Leu Gin Leu Met He Asn Leu Leu Gin Pro Lys Tyr Leu Phe 
15 370 375 380 

Pro Val Gin Gly Glu Tyr Arg Glu Leu Asp Ala His Ala Lys Ala Ala 
385 390 395 400 

20 Met Ala Val Gly Met Leu Pro Glu Arg He Phe He Pro Lys Lys Gly 

405 410 415 

Thr Thr Met Ala Tyr Glu Asn Gly Asp Phe Val Pro Ala Gly Ser Val 
420 425 430 

Ser Ala Gly Asp He Leu lie Asp Gly Asn Ala He Gly Asp Val Gly 
435 440 445 

Asn Val Val Leu Arg Asp Arg Lys Val Leu Ser Glu Asp Gly He Phe 
30 450 455 460 

He Val Ala He Thr Val Asn Arg Arg Glu Lys Lys He Val Ala Arg 
465 470 . 475 480 

35 Ala Arg Val His Thr Arg Gly Phe Val Tyr Leu Lys Lys Ser Arg Asp 

485 490 495 

lie Leu Arg Glu Ser Ser Glu Leu He Asn Gin Thr Val Glu Asp Tyr 
500 505 510 

40 

Leu Gin Gly Asp Asp Phe Asp Trp Ala Asp Leu Lys Gly Lys Val Arg 
515 520 525 

Asp Asn Leu Thr Lys Tyr Leu Phe Asp Gin Thr Lys Arg Arg Pro -Ala 
.45 530 535- 540 

He Leu Pro Val Val Met Glu Ala Lys 
545 550 

50 

<210> 131 
<211> 316 
. <212> PRT ; 
<213> Streptococcus pneumoniae 

55 

<400> 131 

Met Thr Lys Glu Phe His His Val Thr Val Leu Leu His Glu Thr lie 
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10 



15 



Asp Met Leu Asp Val Lys Pro Asp Gly lie Tyr Val Asp Ala Thr Leu 
20 25 30 

Gly Gly Ala Gly His Ser Glu Tyr Leu Leu Ser Lys Leu Ser Glu Lys 
35 40 45 

Gly His Leu Tyr Ala Phe Asp Gin Asp Gin Asn Ala lie Asp Asn Ala 
50 55 60 

Gin .Lys Arg Leu Ala Pro Tyr lie Glu Lys Gly Val Val Thr Phe He 
65 70 75 80 

Lys Asp Asn Phe Arg His Leu Gin Ala Arg Leu Arg Glu Ala Gly Val 
85 90 95 

Gin Glu He Asp Gly He Cys Tyr Asp Leu Gly Val Ser Ser Pro Gin 
100 105 110 

Leu Asp Gin Arg Glu Arg Gly Phe Ser Tyr Lys Lys Asp Ala Pro Leu 
115 120 125 

Asp Met Arg Met Asn Gin Asp Ala Ser Leu Thr Ala Tyr Glu Val Val 
130 135 140 

Asn His Tyr Asp Tyr His Asp Leu Val Arg He Phe Phe Lys Tyr Gly 
145 150 155 160 

Glu Asp Lys Phe Ser Lys Gin He Ala Arg Lys He Glu Gin Ala Arg 
165 17.0 175 

Glu Val Lys Pro He Glu Thr Thr Thr Glu Leu Ala Glu He He Lys 
180 185 190 

Leu Val Lys Pro Ala Lys Glu Leu Lys Lys Lys Gly His Pro Ala Lys 
195 200 205 

Gin He Phe Gin Ala He Arg He Glu Val Asn Asp Glu Leu Gly Ala 
210 • 215. 220 

Ala Asp Glu Ser He Gin Gin Ala Met Asp Met Leu Ala Leu Asp Gly 
225 230 235 240 

Arg He Ser Val He Thr Phe His Ser Leu Glu Asp Arg Leu Thr Lys 
245 250 255 

Gin Leu Phe Lys Glu Ala Ser Thr Val Glu Val Pro Lys Gly Leu Pro 
260 265 270 

Phe He Pro Asp Asp Leu Lys Pro Lys Met Glu Leu Val Ser Arg Lys 
275 - 280 285 

Pro He Leu Pro Ser Ala Glu Glu Leu Glu Ala Asn Asn Arg Ser His 
290 295 300 



Ser Ala Lys Leu Arg Val Val Arg Lys lie His Lys 
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305 



310 



315 



<210> 132 
<211> 332 
<212> PRT 

<213"> Streptococcus pneumoniae 
<400> 132 

Met Ser Arg lie Leu Asp Asn Glu lie Met Gly Asp Glu Glu Leu Val 
1 5 10 15 

Glu Arg Thr Leu Arg Pro Gin Tyr Leu Arg Glu Tyr lie Gly Gin Asp 
20 25 30 

Lys Val Lys Asp Gin Leu Gin lie Phe lie Glu Ala Ala Lys Met Arg 
35 40 45 

Asp Glu Ala Leu Asp His Val Leu Leu Phe Gly Pro Pro Gly Leu Gly 
50 ' ' 55 60 

Lys Thr Thr Met Ala Phe Val lie Ala Asn Glu Leu Gly Val Asn Leu 
65 70 75 80 

Lys Gin Thr Ser Gly Pro Val lie Glu Lys Ala Gly Asp Leu Val Ala 
85 90 95 

lie Leu Asn Glu Leu Glu Pro Gly Asp Val Leu Phe lie Asp Glu lie 
100 105 110 

His Arg Leu Pro Met Ser Val Glu Glu Val Leu Tyr Ser Ala Met Glu 
115 120 125 

Asp Phe Tyr lie Asp lie Met lie Gly Ala Gly Glu Gly Ser Arg Ser 
130 135 140 

Val His Leu Glu Leu Pro Pro Phe Thr Leu lie Gly Ala Thr Thr Arg 
145 150 155 160 

Ala Gly Met Leu Ser Asn Pro Leu Arg Ala Arg Phe Gly lie Thr Gly 
165 • 170 .175 

His Met Glu Tyr Tyr Ala His Ala Asp Leu Thr. Glu lie Val Glu Arg 
180 185 190 

Thr- Ala Asp He Phe Glu Met Glu He Thr His Glu Ala Ala Ser Glu 

195 200 .... 205 

Leu Ala Leu Arg Ser Arg Gly Thr Pro Arg He Ala Asn Arg Leu Leu 
210 215 220 

Lys Arg Val Arg Asp Phe Ala Glh He Met Gly Asn Gly Val He Asp 
225 230 235 240 

Asp lie lie Thr Asp Lys Ala Leu Thr Met Leu Asp Val Asp His Glu 



245 



250 



"255 
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Gly Leu Asp Tyr Val Asp Gin Lys lie Leu Arg Thr Met lie Glu Met 
260 265 270 

Tyr Ser Gly Gly Pro Val Gly Leu Gly Thr Leu Ser Val Asn lie Ala 
275 280 285 

Glu Glu Arg Glu Thr Val Glu Asp Met Tyr Glu Pro Tyr Leu lie Gin 

290 295 300 

Lys Gly Phe lie Met Arg Thr Arg Ser Gly Arg Val Ala Thr Ala Lys 
305 310 315 320 

Ala Tyr Glu His Leu Gly Tyr Glu Tyr Ser Glu Lys 



<210> 133 
<211> 436 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 133 

Met Ser Met Phe Leu Asp Thr Ala Lys lie Lys Val Lys Ala Gly Asn 
1 5 . 10 15 

Gly Gly Asp Gly Met Val Ala Phe Arg Arg Glu Lys Tyr Val Pro Asn 
20 25 30 

Gly Gly Pro Trp Gly Gly Asp Gly Gly Arg Gly Gly Asn Val Val Phe 
35 40 45 

Val Val Asp Glu Gly Leu Arg Thr Leu Met Asp Phe Arg Tyr Asn Arg 
50 55 60 

His Phe Lys Ala Asp Ser Gly Glu Lys Gly Met Thr Lys Gly Met His 
.65 70 75 80 

Gly Arg Gly Ala Glu Asp Leu Arg Val Arg Val Ser Gin Gly Thr Thr 
85 90 95 

Val Arg Asp Ala Glu Thr Gly Lys Val Leu Thr Asp Leu lie Lys His 
10Q 105 110 

Gly Gin Glu Phe He Val Ala His Gly Gly Arg Gly Gly Arg Gly Asn 
115 120 125 

He Arg Phe Ala Thr Pro Lys Asn Pro Ala Pro Glu He Ser Glu Asn 
130 135 140 

Gly Glu Pro Gly Gin Glu Arg Glu Leu Gin Leu Glu Leu Lys He Leu 
145 150 155 160 

Ala Asp Val Gly Leu Val Gly Phe Pro Ser Val Gly Lys Ser Thr Leu 



325 



330 



165 



170 



175 



Leu Ser Val He Thr Ser Ala 
180 



Lys Pro Lys He Gly Ala Tyr 
185 . 190 



His Phe 
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Thr Thr lie Val Pro -Asn Leu Gly Met Val Arg Thr Gin Ser Gly Glu 
195 200 205 

Ser Phe Ala Val Ala Asp Leu Pro Gly Leu lie Glu Gly Ala Ser Gin 
210 215 220 

Gly Val Gly Leu Gly Thr Gin Phe Leu Arg His He Glu Arg Thr Arg 
225 230 235 240 

Val He Leu His He He Asp Met Ser Ala Ser Glu Gly Arg Asp Pro 
245 250 255 

Tyr Glu Asp Tyr Leu Ala He Asn Lys Glu Leu Glu Ser Tyr Asn Leu 
260 265 270 

Arg Leu Met Glu Arg Pro Gin He He Val Ala Asn Lys Met Asp Met 
275 280 285 

Pro Glu Ser Gin Glu Asn Leu Glu Glu Phe Lys Lys Lys Leu Ala Glu 
290 295 300 

Asn Tyr Asp Glu Phe Glu Glu Leu Pro Ala He Phe Pro He Ser Gly 
305 310 315 320 

Leu Thr Lys Glri Gly Leu Ala Thr Leu Leu Asp Ala Thr Ala Glu Leu 
325 330 335 

Leu Asp Lys Thr Pro Glu Phe Leu Leu Tyr Asp Glu Ser Asp Met Glu 
340 345 350 

Glu Glu Ala Tyr Tyr Gly Phe Asp Glu Glu Glu Lys Ala Phe Glu He 
355 360 365 

Ser Arg Asp Asp Asp Ala Thr Trp Val Leu Ser Gly Glu Lys Leu Met 
370 375 380 

Lys Leu Phe Asn Met Thr Asn Phe Asp Arg Asp Glu Ser Val Met Lys 
385 390 395 * 400 

Phe Ala Arg Gin Leu Arg Gly Met Gly Val Asp Glu Ala Leu Arg Ala 
405 ' 410 415 

Arg Gly Ala Lys Asp Gly Asp Leu Val Arg He Gly Lys Phe Glu Phe 
420 425 430 

Glu Phe Val Asp 
435 



<210> 134 
<211> 172 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 134 

Met Asn Tyr Phe. Asn Val Gly Lys He Val Asn Thr Gin Gly Leu Gin 
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15 10 15 

Gly Glu Met Arg Val Leu Ser Val Thr Asp Phe Ala Glu Glu Arg Phe 
20 25 30 

5 . 

Lys Lys Gly Ala Glu Leu Ala Leu Phe Asp Glu Lys Asp Gin Phe Val 
35 40 45 

Gin Thr Val Thr lie Ala Ser His Arg Lys Gin Lys Asn Phe Asp lie 
10 50 55 60 

lie Lys Phe Lys Asp Met Tyr His He Asn Thr He Glu Lys Tyr Lys 
65 70 75 80 

15 Gly Tyr Ser Leu Lys Val Ala Glu Glu Asp Leu Asn Asp Leu Asp Asp 

.85 90 95 



20 



45 



Gly Glu Phe Tyr Tyr His Glu He He Gly Leu Glu Val Tyr Glu Gly 
100 105 110 

Asp Ser Leu Val Gly Thr He Lys Glu He Leu Gin Pro Gly Ala Asn 
115 120 125 



Asp Val Trp Val Val Lys Arg Lys Gly Lys Arg Asp Leu Leu Leu Pro 
25 130 135 140 

Tyr He Pro Pro Val Val Leu Asn Val Asp He Pro Asn Lys Arg Val 
145 150 155 160 

30 Asp Val Glu He Leu Glu Gly Leu Asp Asp Glu Asp 

165 170 



<210> 135 

35 <211> 239 

<212> PRT 

<213> Streptococcus pneumoniae 

<400> 135 

40 Met Lys He Asp He Leu Thr Leu Phe Pro Glu Met Phe Ser Pro Leu 
1 5 10 15 

Glu His Ser He Val Gly Lys Ala Arg Glu Lys Gly Leu Leu Asp He 
20 25 30 

Gin Tyr His Asn Phe Arg Glu Asn Ala Glu Lys Ala Arg His Val Asp 
35 40 45 



Asp Glu Pro Tyr Gly Gly Gly Gin Gly Met Leu Leu Arg Ala Gin Pro 
50 50 55 60 

He Phe Asp Ser Phe Asp Ala He Glu Lys Lys Asn Pro Arg Val He 

65 .70 -75 .80 

55 Leu Leu Asp Pro Ala Gly Lys Gin Phe Asp Gin Ala Tyr Ala Glu Asp 

85 90 95 



73 



WO 01/49721 



PCT/US00/35604 



Leu Ala Gin Glu Glu Glu Leu lie Phe lie Cys Gly His Tyr Glu Gly 
100 105 110 

Tyr Asp Glu Arg lie Lys Thr Leu Val Thr Asp Glu lie Ser Leu Gly 
115 120 125 

Asp Tyr' Val Leu Thr Gly Gly Glu Leu Ala Ala Met Thr Met He Asp 
130 135 140 

Ala Thr Val Arg Leu He Pro Glu Val He Gly Lys Glu Ser Ser His 
145 150 155 160 

Gin Asp Asp Ser. Phe Ser Ser Gly Leu Leu Glu Tyr His Gin Tyr Thr 
165 ' * 170 175 

Arg Pro Tyr Asp Tyr Arg Gly Met Val Val Pro Asp Val Leu Met Ser 
180 185 190 

Gly His His Glu Lys He Arg Gin Trp Arg Leu Tyr Glu Ser Leu Lys 
195' 200 205 

Lys Thr Tyr Glu Arg Arg Pro Asp Leu Leu Glu His Tyr Gin Leu Thr 
210 215 220 



Val Glu Glu Glu Lys Met Leu Ala Glu He Lys Glu Asn Lys Glu 
225 230 235 



<210> 136 
<211> 186 
<212> PRT 

<213> Streptococcus pneumoniae 



<400> 136 

Met He Glu Ala Ser Lys Leu Lys Ala Gly Met Thr Phe Glu Thr Ala 
15 10 15 

Asp Gly Lys Leu He Arg Val Leu Glu Ala Ser His His Lys Pro Gly 
20 25 30 

Lys Gly Asn Thr He Met Arg Met Lys Leu Arg Asp Val Arg Thr Gly 
35 40 . 45 

Ser Thr Phe Asp Thr Ser Tyr Arg Pro" Glu Glu Lys Phe Glu Gin Ala 
50 55 60 

He He Glu Thr Val Pro Ala Gin Tyr Leu Tyr Lys Met Asp Asp Thr 
65 70 75 80 

Ala Tyr Phe Met Asn Thr Glu Thr Tyr Asp Gin Tyr Glu He Pro Val 
85 90 95 

Val Asn Val Glu Asn Glu Leu Leu Tyr He Leu Glu Asn Ser Asp Val 
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100 105 110 

Lys lie Gin" Phe Tyr Gly Thr Glu Val lie Gly Val Thr Val Pro Thr 
115 120 125 

5 

Thr Val Glu Leu Thr Val Ala Glu Thr Gin Pro Ser He Lys Gly Ala 
130 135 140 

Thr Val Thr Gly Ser Gly Lys Pro Ala Thr Met Glu Thr Gly Leu Val 
10 145 150 155 160 

Val Asn Val Pro Asp Phe He Glu Ala Gly Gin Lys Leu Val He Asn 
165" 170 175 

15 Thr Ala Glu Gly Thr Tyr Val Ser Arg Ala 
180 185 



20 <210> 137 

<211> 523 

<212> PRT 

<213> Streptococcus pneumoniae 

25 <400> 137 

Met Ala Phe Glu Ser Leu Thr Glu Arg Leu Gin Asn Val Phe Lys Asn 
1 5 10 15 

Leu Arg Lys Lys Gly Lys He Ser Glu Ser Asp Val Gin Glu Ala Thr 
30 20. 25 30 

Lys Glu He Arg Leu Ala Leu Leu Glu Ala Asp Val Ala Leu Pro Val 
35 40 45 

35 Val Lys Asp Phe He Lys Lys Val Arg Glu Arg Ala Val Gly His Glu 
50 55 60 

Val lie Asp Thr Leu Asn Pro Ala Gin Gin He He Lys He Val Asp 
65 70 75 . 80 

40 

Glu Glu Leu Thr Ala Val' Leu Gly Ser Asp Thr Ala Glu He He Lys 
85 90 95 

Ser Pro Lys He Pro Thr He He Met Met Val Gly Leu Gin Gly Ala 
45 100 105 110 

Gly Lys Thr Thr Phe Ala Gly Lys Leu Ala Asn Lys Leu Lys Lys Glu 
115 120 "" 125 

50 Glu Asn Ala Arg Pro Leu Met He Ala Ala Asp He Tyr Arg Pro Ala 
130 135 140 

Ala He Asp Gin Leu Lys Thr Leu Gly Gin Gin He Asp Val Pro Val 
145 150 155 * 160 

55 

Phe Ala Leu Gly Thr Glu Val Pro Ala Val Glu He Val Arg Gin Gly 
165 170 175 
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Leu Glu Gin Ala Gin Thr Asn His Asn Asp Tyr Val Leu lie Asp Thr 
180 185 190 

Ala Gly Arg Leu Gin lie Asp Glu Leu Leu Met Asn Glu Leu Arg Asp 
195 200 205 



10 



Val Lys Thr Leu Ala Gin Pro Asn Glu He Leu Leu Val Val Asp Ala 
210 215 220 

Met He Gly Gin Glu Ala Ala Asn Val Ala Arg Glu Phe Asn Ala Gin 
225 230 235 240 



Leu Glu Val Thr Gly Val He Leu Thr Lys He Asp Gly .Asp Thr Arg 
15 245 250 255 

Gly Gly Ala Ala Leu Ser Val Arg His He Thr Gly Lys Pro He Lys 
260 265 270 

20 Phe Thr Gly Thr Gly Glu Lys He Thr Asp He Glu Thr Phe His Pro 
275 280 285 



25 



Asp Arg Met Ser Ser Arg He Leu Gly Met Gly Asp Met Leu Thr Leu 
290 295 300 

He Glu Lys Ala Ser Gin Glu ,Tyr Asp Glu Gin Lys Ala Leu Glu Met 

305 310 315 320 



Ala Glu Lys Met Arg Glu Asn Thr Phe Asp Phe Asn Asp Phe He Asp 
30 325 330 335 

Gin Leu Asp Gin Val Gin Asn Met Gly Pro Met Glu Asp Leu Leu Lys 
340 345 350 

35 Met He Pro Gly Met Ala Asn Asn Pro Ala Leu Gin Asn Met Lys Val 
355 360 365 



40 



Asp Glu Arg Gin lie Ala Arg Lys Arg Ala He Val Ser Ser Met Thr 
370 375 380 

Pro Glu Glu Arg Glu Asn Pro Asp Leu Leu Asn Pro Ser Arg Arg Arg 

385 390 395 400 



Arg He Ala Ala Gly Ser Gly Asn Thr Phe Val Glu Val Asn Lys Phe 
45 405 410 415 

He Lys Asp Phe Asn Gin Ala Lys Gin Leu Met Gin Gly Val Met Ser 
420 425" 430 

.50 Gly Asp Met Asn Lys Met Met Lys Gin Met Gly He Asn Pro Asn Asn 
435 440 445 



55 



Leu Pro Lys Asn Met Pro Asn Met Gly Gly Met Asp Met Ser Ala Leu 
450 455 460 

Glu Gly Met Met Gly Gin Gly Gly Met Pro Asp Leu Ser Ala Leu Gly 

465 470 475 480 
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Gly Ala Gly Met Pro Asp Met Ser Gin Met Phe Gly Gly Gly Leu Lys 

485 490 495 

5 Gly Lys lie Gly Glu Phe Ala Met Lys Gin Ser Met Lys Arg Met Ala 

500 505 510 



10 



20 



35 



50 



Asn Lys Met Lys Lys Ala Lys Lys Lys Arg Lys 
515 520 



<210> 138 

<211> 281 

<212> PRT 

15 <213> Streptococcus pneumoniae 

<400> 138 

Met Tyr Leu He Glu He Leu Lys Ser He Phe Phe Gly He Val Glu 
1 5 10 15 



Gly He Thr Glu Trp Leu Pro lie Ser Ser Thr Gly His Leu He Leu 
20 25 30 



Ala Glu Glu Phe He Gin Tyr Gin Asn Gin Asn Glu Ala Phe Met Ser 

25 35 40 45 

Met Phe Asn Val Val He Gin Leu Gly Ala He Leu Ala Val Met Val 

50 55 60 

30 He Tyr Phe Asn Lys Leu Asn. Pro Phe Lys Pro' Thr Lys Asp Lys Gin 

65 70. 75 80 



Glu Val Arg Lys Thr Trp Arg Leu Trp Leu Lys Val Leu He Ala Thr 
85 90 95 

Leu Pro Leu Leu Gly Val Phe Lys Phe Asp Asp Trp Phe Asp Thr His 

100 105 110 



Phe His Asn Met Val Ser Val Ala Leu Met Leu He He Tyr Gly Val 
40 " 115 120 125 

Ala Phe He Tyr Leu Glu Lys Arg Asn Lys Ala Arg Ala He Glu Pro 
130 135 140 

45 Ser Val Thr Glu Leu Asp Lys Leu Pro Tyr Thr Thr Ala- Phe Tyr He 
145 150 155 160 



Gly Leu Phe Gin Val Leu Ala Leu Leu' Pro Gly Thr Ser Arg Ser Gly 
165 170 175 

Ala Thr He Val Gly Gly Leu Leu Asn Gly Thr Ser Arg Ser Val Val 
180 185 190 



Thr Glu Phe Thr Phe Tyr Leu Gly He Pro Val Met Phe Gly Ala Ser 
55 195 200 205 

Ala Leu Lys He Phe Lys Phe Val Lys Ala Gly Glu Leu Leu Ser Phe 
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210 215 

Gly Gin Leu Phe Leu Leu Leu Val 
225 230 

Ser Met Val Ala He Arg Phe Leu 
245 

Phe Thr Leu Phe Gly Lys Tyr Arg 
260 

Leu Tyr Ser Phe Val Arg Leu "Phe 
275 280 



220 

Ala Met Gly Val Ala Phe Ala Val 
235 240 

Thr Ser Tyr Val Lys Lys His Asp 
250 255 

He Val Leu Gly Ser Val Leu Leu 
265 270 

Val 



<210> 139 
<211> 429 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 139 

Met Gly Leu Phe Asp Arg Leu Phe Gly Lys Lys Glu Glu Pro Lys He 
15 10 15 

Glu Glu Val Val Lys Glu Ala Leu Glu Asn Leu Asp Leu Ser Glu Asp 
20 25 30 

He Glu Pro Ala Phe Thr Glu Ala Glu Glu Val Ser Gin Glu Glu Ala 
35 40 45 

Glu Val Glu Ser Ser Glu Glu Ser Val Phe Gin Glu Glu Asp Ser Gin 
50 55 60 

Asp Thr Val Glu Glu Asn Leu Asp Leu Glu Pro Val Val Glu Val Ser 
65 70 75 80 

Gin Glu Glu Val Glu Glu Phe Pro Asn Ser Gin Glu Val Thr Glu Glu 
85 90 95 

Glu Lys Leu Glu His Glu Gly Thr Val Glu Glu Asn Asn Phe Glu Val 
100 ^ 105 110 

Leu Glu Pro Glu Ala Pro Gin Thr Glu Glu Thr Val Gin Glu Lys Tyr 
115 120 125 

Asp Arg Ser Leu Lys Lys Thr Arg Thr Gly Phe Gly Ala Arg Leu Asn 
130 135 140 

Ala Phe Phe Ala Asn Phe Arg Ser Val Asp Glu Glu Phe Phe Glu Glu 
145 150 155 160 

Leu Glu Glu Leu Leu -He Met. Ser Asp Val Gly Val Gin Val Ala Ser 
165 170 175 

Asn Leu Thr Glu Glu Leu Arg Tyr Glu Ala Lys Leu Glu Asn Ala Lys 
180 185 190 
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Lys Pro Asp Ala Leu Arg Arg Val He He Glu Lys Leu Val Glu Leu 
195 * 200 205 

Tyr Glu Lys Asp Gly Ser Tyr Asp Glu Ser He His Phe Gin Asp Asn 
5 210 215 220 

Leu Thr Val Met Leu Phe Val Gly Val Asn Gly Val Gly Lys Thr Thr 
225 ' 230 235 240 

10 Ser He Gly Lys Leu Ala His Arg Tyr Lys Arg Ala Gly Lys Lys Val 

245 250 255 

Met Leu Val Ala Ala Asp Thr Phe Arg Ala Gly Ala Val Ala Gin Leu 
260 265 270 

15 

Ala Glu Trp Gly Arg Arg Val Asp Val Pro Val Val Thr Gly Pro Glu 
275 * 280 285 

Lys Ala Asp Pro Ala Ser Val Val Phe Asp Gly Met Glu Arg Ala Val 
20 290 295 300 

Ala Glu Gly He Asp He Leu Met He Asp Thr Ala Gly Arg Leu Gin 
305 ' . 310 315 320 

25 Asn Lys Asp Asn Leu Met Ala Glu Leu Glu Lys He Gly Arg He He 

325 330 335 

Lys Arg Val Val Pro Glu Ala Pro. His Glu Thr Phe Leu Ala Leu Asp 
340 345 350 

30 

Ala Ser Thr Gly Gin Asn Ala Leu Val .Gin Ala Lys Glu Phe Ser Lys 
355 360 365 

He Thr Pro Leu Thr Gly He Val Leu Thr Lys He Asp Gly Thr Ala 
35 370 375 380 

Arg Gly Gly Val Val Leu Ala He Arg Glu Glu Leu Asn He Pro Val 
385 ' 390 395 400 

40 Lys Leu He Gly Phe Gly Glu Lys He Asp Asp He Gly Glu. Phe Asn 

405 410 415 

Ser Glu Asn Phe Met Lys Gly Leu Leu Glu Gly Leu He 
420 425 • 

45 

<210> 140 
<211> 165 
<212> £RT 
50 <213> Streptococcus pneumoniae 

<400> 140 

Met Tyr He Glu Met Val Asp Glu Thr Gly Gin Val Ser Lys Glu Met 
15 10 15 

55 

Leu Gin Gin Thr Gin Glu He Leu Glu Phe Ala Ala Lys Lys Leu Gly 
20 25 30 
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Lys Glu Asp Lys Glu Met Ala Val Thr Phe Val Thr Asn Glu Arg Ser 
35 40 45 

His Glu Leu Asn Leu Glu Tyr Arg Asp Thr Asp Arg Pro Thr Asp Val 
50 55 60 

lie Ser Leu Glu Tyr Lys Pro Glu Leu Glu lie Ala Phe Asp Glu Glu 
65 70 75 80 

Asp Leu Leu Glu Asn Pro Glu Leu Ala Glu Met Met Ser Glu Phe Asp 
85 90 . 95 

Ala Tyr lie Gly Glu Leu Phe lie Ser lie Asp Lys .Ala His Glu Gin 
100 105 110 

Ala Glu Glu Tyr Gly His Ser Phe Glu Arg Glu Met Gly Phe Leu Ala 
11*5 120 125 

Val His Gly Phe Leu His lie Asn Gly Tyr Asp His Tyr Thr Pro Glu 
130 135 140 

Glu Glu Ala Glu Met Phe Gly Leu Gin Glu Glu lie Leu Thr Ala Tyr 
145 150 155 160 

Gly Leu Thr Arg Gin 
165 



<210> 141 
<211> 255 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 141 

Met Ser lie Arg Val He He Ala Gly Phe Lys Gly Lys Met Gly Gin 
15 10 15 

Ala Ala Cys Gin Met Val Leu Thr Asp Pro Asp Leu Asp Leu Val Ala 
20 25 30. 

Val Leu Asp Pro Phe Glu Ser Glu Ser Glu Trp Gin Gly He Pro Val 
35 ' 40 45 

Phe Lys Asp Lys Ala Asp Leu Ala Gly Phe Glu Ala Asp Val Trp Val 
50 55 60 

Asp Phe Thr Thr Pro Ala Val Ala Tyr Glu Asn Thr Arg Phe Ala Leu 
65 70 75 -80 

Glu Asn Gly Phe Ala Pro Val Val Gly Thr Thr Gly Phe Thr Ser Glu 
85 90 95 

Glu He Ala Glu Leu Lys Glu Phe Ser Arg Ala Gin Asp Leu Gly Gly 
100 105 110 

Leu He Ala Pro Asn Phe Ala Leu Gly Ala Val Leu Leu Met Gin Phe 
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115 120 125 

Ala Thr Gin Ala Ala Lys Tyr Phe Pro Asn Val Glu He He Glu Leu 
130 135 140 

5 

His His Asp Lys Lys Lys Asp Ala Pro Ser Gly Thr Ala He Lys Thr 
145 150 155 160 

Ala Glu Leu Met Ala Glu Val Arg Glu Ser He Gin Gin Gly Ala Ala 
10 165 170 175 

Asp Glu Glu Glu Leu He Ala Gly Ala Arg Gly Ala Asp Phe Asp Gly 
180 185 190 

15 Met Arg lie His Ser Val Arg Leu Pro Gly Leu Val Ala His Gin Glu 
195 200 205 

Val lie Phe Gly Asn Gin Gly Glu Gly Leu Thr Leu Arg His Asp Ser 
210 215 220 

20 

Tyr Asp Arg lie Ser Phe Met Thr Gly Val Asn Leu Gly He Lys Glu 
225 230 235 240 

Val Val Lys Arg His Glu Leu Val Tyr Gly Leu Glu His Leu Leu 
25 245 250 255 



<210> 142 
<211> 91 
30 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 142 

Met Ala Asn Lys Gin Asp Leu He Ala. Lys Val Ala Glu Ala Thr Glu 
35 1 5 10 15 

Leu Thr Lys Lys Asp Ser Ala Ala Ala Val Glu Ala Val Phe Ala Ala 
20 25 30 

40 Val Ala Asp Tyr Leu Ala Ala Gly Glu Lys Val Gin Leu lie Gly Phe 
35 40 45 

Ser Asn Phe Glu Val Arg* Glu Arg Ala Glu Arg Lys Gly Arg Asn Pro 
50 55 60 

45 

Gin Thr Gly Lys Glu Met Thr He Ala Ala Ser Lys Val Pro Ala Phe- 
65 70 75 80 

Lys Ala Gly Lys Ala Leu Lys Asp Ala Val Lys 
50 . 85 90 



<210> 143 
<211> 306 
55 <212> PRT 

<213> Streptococcus pneumoniae 
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10 



<400> 143 

Met Thr Lys Thr Ala Phe Leu Phe Ala Gly Gin Gly Ala Gin Tyr Leu 
1 ' 5 10 15 

Gly Met Gly Arg Asp Phe Tyr Asp Gin Tyr Pro lie Val Lys Glu Thr 
20 25 30 

lie Asp Arg Ala Ser Gin Val Leu Gly Tyr Asp Leu Arg Tyr Leu lie 
35 40 45 

Asp Thr Glu Glu Asp Lys Leu Asn Gin Thr Arg Tyr Thr Gin Pro Ala 
50 55 60 



lie Leu Ala Thr Ser Val Ala lie Tyr Arg Leu Leu Gin Glu Lys Gly 

15 65 70 75 80 

Tyr Gin Pro Asp Met Val Ala Gly Leu Ser Leu Gly Glu Tyr Ser Ala 
85 90 95 

20 Leu Val Ala Ser Gly Ala Leu Asp Phe Glu Asp Ala Val Ala Leu Val 

100 105 110 



25 



Ala Lys Arg Gly Ala Tyr Met Glu Glu Ala Ala Pro Ala Asp Ser Gly 
115 120 125 

Lys Met Val Ala Val Leu Asn Thr Pro Val Glu Val lie Glu Glu Ala 
130 135 140 



Cys Gin Lys Ala Ser Glu Leu Gly Val Val Thr Pro Ala Asn Tyr Asn 
30 145. ' 150 155 160 

Thr Pro Ala Gin lie Val lie Ala Gly Glu Val Val Ala Val Asp Arg 
165 . 170 175 

35 Ala Val Glu Leu Leu Gin Glu Ala Gly Ala Lys Arg Leu lie Pro Leu 
180 185 190 



40 



Lys Val Ser Gly Pro Phe His Thr Ala Leu Leu Glu Pro Ala Ser Gin 
195 200 205 

Lys Leu Ala .Glu Thr Leu Ala Gin Val Ser Phe Ser Asp Phe Thr Cys 
210 " 215 220 



Pro Leu Val Gly Asn Thr Glu Ala Ala Val Met Gin Lys Glu Asp He 
45 225 230 235 240 

Ala Gin Leu Leu Thr Arg Gin Val Lys -Glu Pro Val Arg Phe Tyr Glu 
245 250. 255 

50 Ser He Gly Val Met. Gin Glu Ala Gly He Ser Asn Phe He Glu He 

260 265 270 



55 



Gly Pro Gly Lys Val Leu Ser Gly Phe Val Lys Lys He Asp Gin Thr 

275 280 285 

Ala His Leu Ala His Val Glu Asp Gin Ala Ser Leu Val Ala Leu Leu 
290 295 300 
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Glu Lys 
305 



<210> 144 
<211> 243 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 144 

Met Lys Leu- Glu His Lys Asn He Phe He Thr Gly Ser Ser Arg Gly 
1 5 10 15 

He Gly Leu Ala He Ala His Lys Phe Ala Gin Ala Gly Ala Asn He 
20 25 30 

Val Leu Asn Ser Arg Gly Ala He Ser Glu Glu Leu Leu Ala Glu Phe 
35 40 45 

Ser Asn Tyr Gly He Lys Val Val Pro He Ser Gly Asp Val Ser Asp 
50 55 60 

Phe Ala Asp Ala Lys Arg Met He Asp Gin Ala He Ala Glu Leu Gly 
65 70 75 80 

Ser Val Asp Val Leu Val Asn Asn Ala Gly He Thr Gin Asp Thr Leu 
85 90 95 

Met Leu Lys Met Thr Glu. Ala Asp Phe Glu Lys Val Leu Lys Val Asn 
100 - . 105 110 

Leu Thr Gly Ala Phe Asn Met Thr Gin Ser Val Leu Lys Pro Met Met' 
115 120 125 

Lys Ala Arg Glu Gly Ala He He Asn Met Ser Ser Val Val Gly Leu 
130 135 140 

Met Gly Asn He Gly Gin Ala Asn Tyr Ala Ala Ser Lys Ala Gly Leu 
145 150 . 155 160 

He Gly Phe Thr Lys Ser Val Ala Arg Glu Val Ala Ser Arg Asn He 
165 170 175 

Arg Val Asn Val He Ala Pro Gly Met He Glu Ser Asp Met Thr Ala 
180 185 190 

He Leu Ser Asp Lys lie Lys Glu Ala Thr Leu Ala Gin He Pro Met 
195 200 205 

Lys Glu Phe Gly Gin Ala Glu Gin Val Ala Asp Leu Thr Val Phe Leu 
210 215 220 

Ala Gly Gin Asp Tyr Leu Thr Gly Gin Val Val Ala He Asp Gly Gly 
225 230 235 240 

Leu Ser Met 
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<210> 145 

5 <211> 276 

<212> PRT 

<213> Streptococcus pneumoniae 

<400> 145 

10 Met Gly Val Lys Lys Lys Leu Lys Leu Thr Ser Leu Leu Gly Leu Ser 
1 5 .10 15 

Leu Leu lie Met Thr Ala Cys Ala Thr Asn Gly Val Thr Ser Asp lie 
20 25 30 

15 

Thr Ala Glu Ser Ala Asp Phe Trp Ser Lys Leu Val Tyr Phe Phe Ala 
35 40 45 

Glu lie lie Arg Phe Leu Ser Phe Asp lie- Ser He Gly Val Gly He 
20 50 55 60 

He Leu Phe Thr Val Leu He Arg Thr Val Leu Leu Pro Val Phe Gin 
65 70 75 80 

25 Val Gin Met Val Ala Ser Arg Lys Met Gin Glu Ala Gin Pro Arg He 

85 90 95 

Lys Ala Leu Arg Glu Gin Tyr Pro Gly Arg Asp Met Glu Ser Arg Thr 
100 105 110 

30 

Lys Leu Glu Gin Glu Met Arg Lys Val Phe Lys Glu Met Gly Val Arg 
115 120 .125 

Gin Ser Asp Ser Leu Trp Pro He Leu He Gin Met Pro Val He Leu 
35 130 135 140 

Ala Leu Phe Gin Ala Leu Ser Arg Val Asp Phe Leu Lys Thr Gly His 
145 150 155 160 

40 Phe Leu Trp He Asn Leu Gly Ser Val Asp Thr Thr Leu Val Leu Pro 

165 170 175 



45 



He Leu Ala Ala Val Phe Thr Phe Leu Ser Thr Trp Leu Ser Asn Lys 

180 185 190 

Ala Leu Ser Glu Arg Asn Gly Ala Thr -Thr Ala Met Met Tyr Gly He 

195 200 205 



Pro Val Leu He Phe He Phe Ala Val Tyr Ala . Pro Gly Gly Val Ala 
50 210 215 220 

Leu Tyr Trp Thr Val Ser Asn Ala Tyr Gin Val Leu Gin Thr Tyr Phe 
225 230 235 240 

55 Leu Asn Asn Pro Phe Lys lie lie Ala Glu Arg Glu Ala Val Val Gin 

245 250 255 
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Ala Gin Lys Asp Leu Glu Asn Arg Lys Arg Lys Ala Lys Lys Lys Ala 
260 265 270 

Gin Lys Thr Lys 
5 275 



<210> 146 
<211> 409 
10 <212> PRT 

<213> Streptpcoccus pneumoniae 

<400> 146 

Met Lys He Ser Lys Arg His Leu Leu Asn Tyr Ser He Leu He Pro 
15 1 5 10 15 

Tyr Leu Leu Leu Ser He Leu Gly Leu He Val Val Tyr Ser Thr Thr 
20 25 30 

20 Ser Ala He Leu He Glu Glu Gly Lys Ser Ala Leu Gin Leu Val Arg 
35 40 45 

Asn Gin Gly He Phe Trp He Val Ser Leu He Leu He Ala Leu He 
50 55 60- 

25 

Tyr Lys Leu Arg Leu Asp Phe Leu Arg Asn Glu Arg Leu He lie Leu 
65 70 75 80 

Val He Leu He Glu Met Leu Leu Leu Phe Leu Ala Arg Phe He Gly 
30 • 85 90 95 

He Ser Val Asn Gly Ala Tyr Gly Trp He Ser Val Ala Gly Val Thr 
100 105 110 

35 He Gin Pro Ala Glu Tyr Leu Lys He He He He Trp Tyr Leu Ala 
115 120 125 

His Arg Phe Ser Lys Gin Gin Glu Glu lie Ala Thr Tyr Asp Phe Gin 
130 135 140 

40 

Val Leu Thr Gin Asn Gin Trp Leu Pro Arg Ala Phe Asn Asp Trp Arg 
145 150 155 160 

Phe Val Leu Leu Val Leu lie Gly Ser Leu Gly lie Phe Pro Asp Leu 
45 165 170 175 

Gly Asn Ala Thr lie Leu Val Leu Val - Ser Leu lie Met Tyr Thr Val 
180 185 190 

50 Ser Gly lie Ala Tyr Arg Trp Phe Ser Thr lie Leu Ala Leu Val Ser 
195 200 205 

Ala Thr Ser Val Phe Val Leu Thr Thr lie Ser Leu He Gly Val Glu 
210 215 220 

55 

Thr Phe Ser Lys lie Pro Val Phe Gly Tyr Val Ala Lys Arg Phe Ser 
225 230 235 240 
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Ala Phe Phe Asn Pro Phe Ala Asp Arg Ala Asp Ala Gly His Gin Leu 
245 250 255 

5 Ala Asn Ser Tyr Phe Ala Met Val Asn Gly Gly Trp Phe Gly Leu Gly 
260 265 270 

Leu Gly Asn Ser lie Glu Lys Arg Gly Tyr Leu Pro Glu Ala His Thr 
275 280 285 

10 

Asp Phe Val Phe Ser lie Val lie Glu Glu Phe Gly Phe Val Gly Ala 
290 295 300 

Ser Leu lie Leu Ala Leu Leu Phe Phe Met lie Leu Arg lie lie Leu 
15 305 310 315 320 

Val Gly lie Arg Ala Glu Asn Pro Phe Asn Ala Met Val Ala Leu Gly 
325 330 335 

20 Val Gly Gly Met Met Leu Val Gin Val Phe Val Asn lie Gly Gly He 
340 345 350 

Ser Gly Leu He Pro Ser Thr Gly Val Thr Phe Pro Phe Leu Ser Gin 
355 360 365 

25 

Gly Gly Asn Ser Leu Leu Val Leu Ser Val Ala Val Ala Phe Val Leu 
370 375 380 

Asn He Asp Ala Ser Glu Lys Arg Ala Lys Leu Tyr Arg Glu Leu Glu 
30 385 390 395 400 

Asn Gin Pro Met Asn Leu Leu Leu Lys 
405 

35 

<210> 147 
<211> 419 
<212> PRT 

<213> Streptococcus pneumoniae 

40 

<400> 147 

Met Leu Gly He Leu Thr Phe He Leu Val Phe Gly lie lie Val Val 
1 5 10 15 

45 Val His Glu Phe Gly His Phe Tyr Phe Ala Lys Lys Ser Gly lie Leu 
20 25 30 

Val Arg Glu Phe Ala lie Gly Met Gly Pro Lys lie Phe Ala His lie 
35 40 45 - 

50 

Gly Lys Asp Gly Thr Ala Tyr Thr lie Arg He Leu Pro Leu Gly Gly 
50 55 60 

Tyr Val Arg Met Ala Gly Trp Gly Asp Asp Thr Thr Glu He. Lys Thr 
55 65 70 75 ' 80 

Gly Thr Pro Val Ser Leu Thr Leu Ala Asp Asp Gly Lys Val Lys Arg 
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85 



90 



95 



lie Asn Leu Ser Gly Lys Lys Leu Asp Gin Thr Ala Leu Pro Met Gin 
100 105 110 

5 

Val Thr Gin Phe Asp Phe Glu Asp Lys Leu Phe lie Lys Gly Leu Val 
115 120 125 

Leu Glu Glu Glu Lys Thr Phe Ala Val Asp His Asp Ala Thr Val Val 
10 130 135 140 

Glu Ala Asp Gly Thr Glu Val Arg He Ala Pro Leu Asp Val Gin Tyr 
145 150 155 160 

15 Gin Asn Ala Thr lie Trp Gly Lys Leu He Thr Asn Phe Ala Gly Pro 

165 170 175 



20 



Met Asn Asn Phe He Leu Gly Val Val Val Phe Trp Val Leu He Phe 
180 185 190 

Met Gin Gly Gly Val Arg Asp Val Asp Thr Asn Gin Phe His He Met 
195 200 205 



Pro Gin Gly Ala Leu Ala Lys Val Gly Val Pro Glu Thr Ala Gin He 

25 210 215 220 

Thr Lys He Gly Ser His Glu Val Ser Asn Trp Glu Ser Leu He Gin 
225 230 235 240 

30 Ala Val Glu Thr Glu Thr Lys Asp Lys Thr. Ala Pro Thr Leu Asp Val 
• 245 250 255 



35 



Thr He Ser Glu Lys Gly Ser Asp Lys Gin Val Thr Val Thr Pro Glu 
260 265 270 

Asp Ser Gin Gly Arg Tyr Leu Leu Gly Val Gin Pro Gly Val Lys Ser 
275 280 285 



Asp Phe Leu Ser Met Phe Val Gly Gly Phe Thr Thr Ala Ala Asp Ser 
40 290 295 300 

Ala Leu Arg He Leu Ser Ala Leu Lys Asn Leu He Phe Gin Pro Asp 
305 310 315 320 

45 Leu Asn Lys Leu Gly Gly Pro Val Ala He Phe. Lys Ala Ser Ser Asp 

325 330 335 



50 



Ala Ala Lys Asn Gly He Glu Asn He* Leu Tyr Phe Leu Ala Met He 
340 345 350 

Ser He Asn He Gly He Phe Asn Xeu lie Pro He. Pro Ala Leu Asp 
355 360 365 



Gly Gly Lys He Val Leu Asn He Leu Glu Ala He Arg Arg Lys Pro 
55 370 375 380 



Leu Lys Gin Glu He Glu Thr Tyr Val Thr Leu Ala Gly Val Val He 
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385 390 395 400 

Met Val Val Leu Met He Ala Val Thr Trp Asn Asp lie Met Arg Leu 
405 410 415 

5 

Phe Phe Arg 



10 <210> 148 
<211> 197 
<212> PRT 

<213> Streptococcus pneumoniae 
15 <400> 148 

Met Tyr Ala Tyr Leu Lys Gly He He Thr Lys He Thr Ala Lys Tyr 
1 5 10 15 

He Val Leu Glu Thr Asn Gly lie Gly Tyr He Leu His Val Ala Asn 
20 20 ■ 25 30 

Pro Tyr Ala Tyr Ser Gly Gin Val Asn Gin Glu Ala Gin He Tyr Val 
35 40 45 

25 His Gin Val Val Arg Glu Asp Ala His Leu Leu Tyr Gly Phe Arg Ser 
50 .55 60 

Glu Asp Glu Lys Lys Leu Phe Leu Ser Leu He Ser Val Ser Gly lie 
65 70 75 80 

30 

Gly Pro Val Ser Ala Leu Ala He lie Ala Ala Asp Asp Asn Ala Gly 
85 90 * 95 

Leu Val Gin Ala He Glu Thr Lys Asn lie Thr Tyr Leu Thr Lys Phe 
35 100 105 110 

Pro Lys lie Gly Lys Lys Thr Ala Gin Gin Met Val Leu Asp Leu Glu 
115 120 125 

40 Gly Lys Val Val Val Ala Gly Asp Asp Leu Pro Ala Lys Val Ala Val 
130 135 140 ' 

Gin Ala Ser Ala Glu Asn Glh Glu Leu Glu Glu Ala Met Glu Ala Met 
145 150 155 160 

45 

Leu Ala Leu Gly Tyr Lys Ala Thr Glu Leu Lys Lys He Lys Lys Phe 
165 ...170 175 

Phe Glu Gly Thr Thr Asp Thr Ala Glu Asn Tyr He. Lys Ser Ala Leu 
50 180 185 190 

- Lys Met Leu Val Lys 
195 

55 

<210> 149 
<211> 257 
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<212> PRT 

<213> Streptococcus pneumoniae 
<400> 149 

5 Met Lys Asn Asn Arg lie Leu Ala Leu Ser Gly Asn Asp lie Phe Ser 
1 5 10 15 



10 



25 



40 



55 



Gly Gly Gly Leu Ser Ala Asp Leu Ala Thr Tyr Thr Leu Asn Gly Leu 
20 -25 30 

His Gly Phe Val Ala Val Thr Cys Leu Thr Ala Leu Thr Glu Lys Gly 
35 40 45 



Phe Glu Val Phe Pro Thr Asp Asp Thr lie Phe Gin His Glu Leu Asp 
15 50 55 60 

Ser Leu Arg Asp Val Glu Phe Gly Gly He Lys He Gly Leu Leu Pro 
65 70 75 80 

20 Thr Val Ser Val Ala Glu Lys Ala Leu Asp Phe He Lys Gin Arg Pro 

85 90 95 



Gly Val Pro Val Val Leu Asp Pro Val Leu Val Cys Lys Glu Thr His 

100 105 110 

Asp Val Ala Val Ser Glu Leu Cys Gin Glu Leu He Arg Phe Phe Pro 

115 120 125 



Tyr Val Ser Val He Thr Pro Asn Leu Pro Glu Ala Glu Leu Leu Ser 
30 130 135 140 

Gly Gin Glu He Lys Thr Leu Glu Asp Met Lys Thr Ala Ala Gin Lys 
145 150 155 160 

35 Leu His Asp Leu Gly Ala Pro Ala Val He lie Lys Gly Gly Asn Arg 

165 170 175 



Leu Ser Gin Asp Lys Ala Val Asp Val Phe Tyr -Asp Gly Gin Thr Phe 
180 i85 190 

Thr He Leu Glu Asn Pro Val lie Gin Gly Gin Asn Ala Gly Ala Gly 
195 - 200 205 



Cys Thr Phe Ala Ser Ser He Ala Ser His Leu Val Lys Gly Asp Lys 
45 210 215 220 

Phe Leu Pro Ala Val Glu Ser Ser Lys-Ala Phe Val Tyr Arg Ala He 
225 230 235 240 

50 Ala Gin Ala Asp Gin Tyr Gly Val Arg Gin Tyr Glu Ala Asn Lys Asn 

245 250 255 



Asn 



<210> 150 
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<211> 412 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 150 

Met He Glu Thr Glu Lys Lys Glu Glu Arg Val Leu Leu He Gly Val 
1 5 10 15 

Glu Leu Gin Gly Met Asp Ser Phe Asp Leu Ser Met Glu Glu Leu Ala 
20 25 30 

Ser Leu Ala Lys Thr Ala Gly Ala Val Val Val Asp Ser Tyr Arg Gin 
35 40 45 

Lys Arg Glu Lys Tyr Asp Ser Lys Thr Phe Val Gly Ser Gly Lys Leu 
50 55 60 

Glu Glu He Ala Leu Met Val Asp Ala Glu Glu He Thr Thr Val He 
65 70 75 80 

Val Asn Asn Arg Leu Thr Pro Arg Gin Asn Val Asn Leu Glu Glu Val 
85 90 95 

Leu Gly Val Lys Val He Asp Arg Met Gin Leu He Leu Asp He Phe 
100 105 110 

Ala Met Arg Ala Arg Ser His Glu Gly Lys Leu Gin Val His Leu Ala 
115 120 125 

Gin Phe Lys Tyr Leu Leu Pro Arg Leu Val Gly Gin Gly He Met Leu 
130 135 140 

Ser Arg Gin Ala Gly Gly He Gly Ser Arg Gly Pro Gly Glu Ser Gin 
145 150 155 160 

Leu Glu Leu Asn Arg Arg Ser Val Arg Asn Gin He Thr Asp He Glu 
165 170 175 

Arg Gin Leu Lys Val Val Glu Lys Asn Arg Ala Thr Val Arg Glu Lys 
180 185 190 

Arg Leu Glu Ser Ser Thr Phe Lys He Gly Leu He Gly Tyr Thr Asn 
195 200 205 

Ala Gly Lys Ser Thr He Met Asn He Leu Thr Ser Lys Thr Gin Tyr 
210 215 220 

Glu Ala Asp Glu Leu Phe Ala Thr Leu Asp Ala Thr Thr Lys Ser He 
225 230 235 . 240 

His Leu Gly .Gly Asn Leu Gin Val Thr Leu Thr Asp Thr Val Gly Phe 
245 250 255 

He Gin Asp Leu Pro Thr Glu Leu Val Ser Ser Phe Lys Ser Thr Leu 
260 265 270 

Glu Glu Ser Lys His Val Asp Leu Leu. Val His Val He Asp Ala Ser 
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275 280 285 

Asn Pro Tyr His Glu Glu His Glu Lys Thr Val Leu Ser He Met Lys 
290 ' 295 300 

Asp Leu Asp Met Glu Asp He Pro His Leu Thr Leu Tyr Asn Lys Ala 
305 310 315 320 

Asp Leu Val Glu Asp Phe Thr Pro Thr Gin Thr Pro Tyr Thr Leu He 
325 330 335 

Ser Ala Lys Ser Glu Asp Ser Arg Glu Asn Leu Gin Ala Leu Leu Leu 
340 345 350 

Asp Lys He Lys Glu He Phe Glu Ala Phe Thr Leu Arg Val Pro Phe 
355 .360 365. 

Ser Lys Ser Tyr Lys He His Asp Leu Glu Ser Val Ala He Leu Glu 
370- . . 375 380 

Glu Arg Asp Tyr Gin Glu Asp Gly Glu Val He Thr Gly Tyr He Ser 
385 390 395 400 

Glu Lys Asn Lys Trp Arg Leu Glu Glu Phe Tyr Asp 
405 410 



<210> 151 
<211> 160 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 151 

Met Ala Glu Lys Thr Tyr Pro Met Thr Leu Glu Glu Lys Glu Lys Leu 
1 5 10 15 

Glu Lys Glu Leu Glu Glu Leu Lys Leu Val Arg Arg Pro Glu Val Val 
20 25 30 

Glu Arg He Lys He Ala Arg Ser Tyr Gly Asp Leu Ser Glu Asn Ser 
35 . 40 45 * 

Glu Tyr Glu Ala Ala Lys Asp- Glu Gin Ala Phe Val Glu Gly Gin He 
50 55 60 

Ser Ser Leu Glu Thr Lys He Arg Tyr Ala Glu He Val Asn Ser Asp 
65 70 - 75 80 

Ala Val Ala Gin Asp Glu Val Ala He Gly Lys Thr Val Thr lie Gin 
85 90 95 

Glu He Gly Glu Asp Glu Glu Glu Val Tyr He He Val Gly Ser Ala 
100 105 110 

Gly Ala Asp Ala Phe Ala Gly Lys Val Ser Asn Glu Ser- Pro He Gly 
115 120 125 
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Gin Ala Leu He Gly Lys Lys Thr Gly Asp Thr Ala Thr He Glu Thr 
130 135 14 0 

Pro Val Gly Ser Tyr Asp Val Lys He Leu Lys Val Glu Lys Thr Ala 
5 145 150 155 160 



10 

<210> 152 
<211> 189 
<212> PRT 

<213> Streptococcus pneumoniae 

15 

<400> 152 

Met Thr Lys Leu Leu Val Gly Leu Gly Asn Pro Gly Asp Lys Tyr Phe 
1 5 10 ' 15 

20 Glu Thr Lys His Asn Val Gly Phe Met Leu He Asp Gin Leu Ala Lys 
20 25 30 

Lys Gin Asn Val Thr Phe Thr His Asp Lys He Phe Gin Ala Asp Leu 
35 40 45 

25 

Ala Ser Phe Phe Leu Asn Gly Glu Lys He Tyr Leu Val Lys Pro Thr 
50 55 60 

Thr Phe Met Asn Glu Ser Gly Lys Ala Val His Ala Leu Leu Thr Tyr 
30 65 70 75 80 

Tyr Gly Leu Asp He Asp Asp Leu Leu He He Tyr Asp Asp Leu Asp 
85 90 95 

35 Met Glu Val Gly Lys He Arg Leu Arg .Ala Lys Gly Ser Ala Gly Gly 
100 105 . 110 

His Asn Gly He Lys Ser He He Gin His He Gly Thr Gin Val Phe 
115 120 125 

40 

Asn Arg Val Lys He Gly He Gly Arg Pro Lys Asn Gly Met Ser Val 
130 135 140 

Val His His Val Leu Ser Lys Phe Asp Arg Asp Glu Tyr He Gly He 
45 145 150 155 160 

Leu Gin Ser Val Asp Lys Val Asp Asp -Ser Val Asn Tyr Tyr Leu Gin 
165 . 170 175 

50 Glu Lys Asn Phe Glu Lys Thr Met Gin Arg Tyr Asn Gly 
180 185 



<210> 153 . 

55 <211> 283 

<212> PRT 

<213> Streptococcus pneumoniae 
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<400> 153 

Met lie Leu- lie Thr Gly Ala Asn 
1 5 

Tyr Leu Leu Asp Glu Arg Asn Glu 
20 

Lys Met Asp lie Thr Asn Glu. Glu 
35 40 

Val Lys Pro Thr Leu Val ' Tyr His 
• 50 55 



Gly Gin Leu Gly Thr Glu Leu Arg 
10 15 

Glu Tyr Val .Ala Val Asp Val Ala 
25 30 

Met Val Glu Lys Val Phe Glu Glu 
45 

Cys Ala Ala Tyr Thr Ala Val Asp 
60 



Ala Ala Glu Asp Glu Gly Lys Glu Leu Asp Phe Ala lie Asn Val Thr 
65 70 75. 80 



Gly Thr Lys Asn Val Ala Lys Ala Ser Glu Lys His Gly Ala Thr Leu 
85 90 95 



Val Tyr He Ser Thr 
100 

Gin Glu Trp Glu Val 
115 

Arg Thr Lys Arg Met 
130 

Phe Tyr He lie Arg 
145 



Asp Tyr Val Phe Asp Gly 
105 

Asp Asp Arg Pro Asp Pro 
120 

Gly Glu Glu Leu Val Glu 
135 

Thr Ala Trp Val Phe Gly 
150 . 155 



Lys Lys Pro Val Gly 
110 

Gin Thr Glu Tyr Gly 
125 

Lys His Val Ser Asn 
140 

Asn Tyr Gly Lys Asn 
160 



Phe Val Phe Thr Met Gin Asn Leu 
165 

Val' Val Asn Asp Gin* Tyr Gly Arg 
180 

Glu Phe Met Thr Tyr Leu Ala Glu 
195 200 

His Leu Ser Asn Asp Ala Thr Glu 
210 215 



Ala Lys Thr His Lys Thr Leu Thr 
170 175 

Pro Thr Trp Thr Arg Thr Leu Ala 
185 190 

Asn Arg Lys Glu Phe Gly Tyr Tyr 
205 

Asp Thr Thr Trp Tyr Asp Phe Ala 
220 



Val Glu lie Leu Lys Asp Thr Asp Val Glu Val Lys* Pro Val Asp Ser 
225 230 235 240 

Ser Gin Phe Pro Ala Lys Ala Lys Arg Pro Leu Asn Ser Thr Met Ser 
245 250 255 

Leu Ala Lys Ala Lys Ala Thr Gly Phe Val He Pro Thr Trp Gin Asp 
260 265 270 

Ala Leu Gin Glu Phe Tyr Lys Gin Glu Val Arg 
•275 280 
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<210> 154 

<211> 407 

5 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 154 

Met Lys Arg Ser Leu Asp Ser Arg Val Asp Tyr Ser Leu Leu Leu Pro 
10 1 5 10 15 

Val Phe Phe Leu Leu Val He Gly Val Val Aia He Tyr He Ala Val 
20 25 30 

15 Ser His Asp Tyr Pro Asn Asn He Leu Pro He Leu Gly Gin Gin Val 
35 40 45 

Ala Trp He Ala Leu Gly Leu Val He Gly Phe Val Val Met Leu Phe 
50 55 60 

20 

Asn Thr Glu Phe Leu Trp Lys Val Thr Pro Phe Leu Tyr He Leu Gly 
65 70 75 80 

Leu Gly Leu Met He Leu Pro He Val Phe Tyr Asn Pro Ser Leu Val 
25 85 90 95 

Ala Ser Thr Gly Ala Lys Asn Trp Val Ser He Asn Gly He Thr Leu 
100 105 110 

30 Phe Gin Pro Ser Glu Phe Met Lys He Ser Tyr He Leu Met Leu Ala 
115 120 125 

Arg Val He Val Gin Phe Thr Lys Lys His Lys Glu Trp Arg Arg Thr 
130 135 140 

35 

Val Pro Leu Asp Phe Leu Leu He Phe Trp Met He Leu Phe Thr lie 
145 150 155 160 

Pro Val Leu Val Leu Leu Ala Leu Gin Ser Asp Leu Gly Thr Ala Leu 
40 165 170 175 

Val Phe Val Ala He Phe Ser Gly He Val Leu Leu Ser Gly Val Ser 
180 185 190 

45 Trp Lys lie He He Pro Val Phe Val Thr Ala Val Thr Gly Val Ala 
195 200 205 

Gly Phe Leu Ala He Phe He Ser Lys Asp Gly Arg Ala Phe Leu His 
210 215 220 

50 

Gin He Gly Met Pro Thr Tyr Gin He Asn Arg He Leu Ala Trp Leu 
225 230 235 240 

Ash Pro Phe Glu Phe Ala Gin Thr Thr Thr Tyr Gin Gin Ala Gin Gly 
55 245 250 255 

Gin He Ala lie Gly Ser Gly Gly Leu Phe. Gly Gin Gly Phe Asn Ala 
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260 

Ser Asn Leu Leu lie Pro Val 
275 

5 

lie Ala Glu Asp Phe Gly Phe 
290 295 

Tyr Leu Met Leu He Tyr Arg 
10 305 310 

Asn Gin Phe Tyr Thr Tyr He 
325 

15 Phe His He Phe Glu Asn He 
340 



265 270 

Arg Glu Ser Asp Met He Phe Thr Val 
280 285 

He Gly Ser Val Leu Val He Ala Leu 
300 

Met Leu Lys He Thr Leu Lys Ser Asn 

315 320 

Ser Thr Gly Leu lie Met Met Leu Leu 
330 335 

Gly Ala Val Thr Gly Leu Leu Pro Leu 
345 350 



20 



Thr Gly He Pro Leu Pro Phe He Ser Gin Gly Gly Ser Ala He He 
355 360 365 

Ser Asn Leu He Gly Val Gly Leu Leu Leu Ser Met Ser Tyr Gin Thr 
370 375 380 



Asn Leu Ala Glu Glu Lys Ser Gly Lys Val Pro Phe Lys Arg Lys Lys 
25 385 390 395 400 



Val Val Leu Lys Gin He Lys 
405 



<210> -155 
<211> 202 
<212> PRT 

<213> Streptococcus pneumoniae' 

35 

<400> 155 

Met Gly Lys He He Gly He Thr Gly Gly He Ala Ser Gly Lys Ser 
1.5 10 15 

40 Thr Val Thr Asn Phe Leu Lys His Gin Gly Leu Ser Ser Ser Gly Leu 
20 25 30 



45 



Pro Thr. Gin Cys Ser Thr Asn Tyr Arg Lys Pro Gly Gly Arg Leu Phe 
35 40 45 

Glu Ma Leu Val Gin His Phe Gly Gin Glu He He Leu Glu Asn Gly 
50 55 .60 



Glu Leu Asn Arg Pro Leu He Ala Ser Leu He Phe Ser Asn Pro Glu 
50 65 70 75 80 



Glu Gin Lys Trp Ser Asn Gin He Gin Gly Glu He He Arg Glu Glu 
. 85 90 95 

55 Leu Ala Thr Leu Arg Glu Gin Leu Ala Gin Thr Glu Glu lie Phe Phe 
100 105 HO 
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Met Asp He Pro Leu Leu Phe Glu Gin Asp Tyr Ser Asp Trp Phe Ala 
115 120 125 

Glu' Thr Trp Leu Val Tyr Val Asp Arg Asp Ala Gin Val Glu Arg Leu ■ 
5 130 135 140 

Met Lys Arg Asp Gin Leu Ser Lys Asp Glu Ala Glu Ser Arg Met Ala 
145 150 155 160 

10 Ala Gin Trp Pro Leu Glu Lys Lys Lys Asp Leu Ala Ser Gin Val Leu 

165 170 175 

Asp Asn Asn Gly Asn Gin Asn Gin Leu Leu Asn Gin Val His He Leu 
180 185 190 

15 

Leu Glu Gly Gly Arg Gin Asp Asp Arg Asp 
195 200 



20 <210> 156 
<211> 419 
<212> PRT 

<213> Streptococcus pneumoniae 
25 <400> 156 

Met Arg Lys He Val He Asn Gly Gly Leu Pro Leu Gin Gly Glu He 
1 5 10 15 ■ 

Thr He Ser Gly Ala Lys Asn Ser Val Val Ala Leu He Pro Ala He 
30 20 25 30 " * 

He Leu Ala Asp Asp Val Val Thr Leu Asp Cys Val Pro Asp He Ser 
35 40 45 

35 Asp Val Ala Ser Leu Val Glu He Met Glu Leu Met Gly Ala Thr Val 
50 55 60 

Lys Arg Tyr Asp Asp Val Leu Glu He Asp Pro Arg Gly* Val Gin Asn 
65 70 75 80 

He Pro Met Pro Tyr Gly Lys He Asn Ser Leu Arg Ala Ser Tyr Tyr • 
85 90 95 



40 



Phe Tyr Gly Ser Leu Leu Gly Arg Phe Gly Glu Ala Thr Val Gly Leu 
45 100 105 110 

Pro Gly Gly Cys Asp Leu Gly Pro Arg Pro He Asp Leu His Leu Lys 
115 120 125 

50 Ala Phe Glu Ala Met Gly Ala Thr Ala Ser Tyr Glu Gly Asp Asn Met 
130 135 140 

Lys Leu Ser Ala Lys Asp Thr Gly Leu His Gly Ala Ser He Tyr Met 
145 150 155 160 

55 

Asp Thr Val Ser Val Gly Ala Thr He Asn Thr Met He Ala Ala Val 
165 170 175 



96 



WO 01/49721 PCT/US0Q/35604 



Lys Ala-, Asn Gly Arg T v hr lie He Glu Asn Ala Ala Arg Glu Pro Glu 
1*80 185 190 

He He Asp Val Ala Thr Leu Leu Asn Asn Met Gly Ala His He Arg 
195 200 205 

Gly Ala Gly Thr Asn He He He He Asp Gly Val Glu Arg Leu His 
210 215 220 

Gly Thr Arg His Gin Val He Pro Asp Arg He Glu Ala Gly Thr Tyr 
225 230 235 240 

He Ser Leu Ala Ala Ala Val Gly Lys Gly He Arg He Asn Asn Val 
245 250 255 

Leu Tyr Glu His Leu Glu Gly Phe He Ala Lys Leu Glu Glu Met Gly 
260 265 270 

Val Arg Met Thr Val Ser Glu Asp Ser He Phe Val Glu Glu Gin Ser 
275 280 285 

Asn Leu Lys Ala He Asn He Lys Thr Ala Pro Tyr Pro Gly Phe Ala 
290 295 300 

Thr Asp Leu Gin Gin Pro Leu Thr Pro Leu Leu Leu Arg Ala Asn Gly 
305 310 315 320 

Arg Gly Thr He Val Asp Thr He Tyr Glu Lys Arg Val Asn His Val 
325 330 335 

Phe Glu Leu Ala Lys Met Asp Ala Asp He Ser Thr Thr Asn Gly His 
340 345 350 

He Leu Tyr Thr Gly Gly Arg Asp Leu Arg Gly Ala Ser Val Lys Ala 
355 360 365 

Thr Asp Leu Arg Ala Gly Ala Ala Leu Val He Ala Gly Leu Met Ala 
370 375 380 

Glu Gly Lys Thr Glu He Thr Asn He Glu Phe He Leu Arg Gly Tyr 
385 390 395 400 

Ser Asp He He Glu Lys Leu Arg Asn Leu Gly Ala Asp He Arg Leu 
405 410 415 

Val Glu Asp 



<210> 157 
<211> 231 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 157 

Met Ser Arg He Glu Phe Ser Pro Ser . Leu Met Thr Met Asp Leu Asp 
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1 5 10 ' 15 

Lys Phe Lys Glu Gin lie Thr Phe Leu Asn Asp Lys Val Ala Ser Tyr 
20 25 30 

5 

His He Asp He Met Asp Gly His Phe Val Pro Asn lie Thr Leu Ser 
35 4.0 45 

Pro Trp Phe He Gin Glu Val Gin Lys He Ser Asp Thr Pro Leu Ser 
10 50 55 60 

Val His Leu Met Val Thr Asp Pro Thr Phe Trp Val Asp Gin Val Leu 

65 ' 70 75 * 80 

15 Asp Leu Gin Cys Glu Tyr He Cys He His Ala Glu Val Leu Asn Gly 

85 90 95 



20 



35 



Leu Ala Phe Arg Leu He Asp Lys lie His Asp Ala Gly Leu Lys Ala 
100 105 110 

Gly Val Val Leu Asn Pro Glu Thr Pro Val Ser Thr Tie. Phe Pro Tyr 
115 120 125 



He Asp Leu Leu Asp Lys Val Thr He Met Thr Val Asp Pro Gly Phe 

25 130 135 140 

Ala Gly Gin Arg Phe Leu Glu Ser Thr Leu Tyr Lys He Gin Glu Leu 

145 150 155 160 

30 Arg Gin Leu Arg Val Gin Asn Gly Tyr His Tyr He He Glu Met Asp 

165 170 175 



Gly Ser Ser Ser Arg Lys Thr Phe Lys Gin He Asp Val Ala Gly Pro 
180 185 190 

Asp He Tyr Val He Gly Arg Ser Gly Leu Phe Gly Leu Asp Asp Asp 
195 200 205 



lie Ala Lys Ala Trp Asp He Cys Ser Arg Asp Tyr Glu Glu Met Thr 
40 210 . 215 220 

Gly Lys Thr Met Pro He Lys 
225 230 



45 

<210> 158 
<211> 374 
<212> PRT 

<213> Streptococcus pneumoniae 

50 

<400> 158 

Met Arg Asn Met Ala Leu Thr Ala Gly He Val Gly Leu Pro Asn Val 
1 5 10 15 

55 Gly Lys Ser Thr Leu Phe Asn Ala He Thr Lys Ala Gly Ala Glu Ala 
20 25 30 
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Ala Asn Tyr Pro Phe Ala Thr lie Asp Pro Asn Val Gly Met Val Glu 

. 1 > 3'5 * 40 45 

Asp Pro Asp Glu Arg Leu Gin Lys Leu Thr Glu Met lie Thr Pro Lys 

5 50 55 60 

Lys Thr Val Pro Thr Thr Phe Glu Phe Thr Asp lie Ala Gly He Val 

65 70 75 80 

10 Lys Gly Ala Ser Lys Gly Glu Gly Leu Gly Asn Lys Phe Leu Ala Asn 

85 90 95 



He Arg Glu Val Asp Ala He Val His Val Val Arg Ala Phe Asp Asp 
100 105 HO 

Glu Asn Val Met Arg Glu Gin Gly Arg Glu Asp Ala Phe Val Asp Pro 
115 120 125 

Leu Ala Asp He Asp Thr He Asn Leu Glu Leu He Leu Ala Asp Leu 
130 135 140 

Glu Ser Val Asn Lys Arg Tyr Ala Arg Val Glu Lys Met Ala Arg Thr 
145 150 155 160 

25 Gin Lys Asp Lys Glu Ser Val Ala Glu Phe Asn Val Leu Gin Lys He 

165 170 175 



15 



20 



Lys Pro Val Leu Glu Asp Gly Lys Ser Ala Arg Thr He Glu Phe Thr 
180 185 190 

Asp Glu Glu Gin Lys Val Val Lys Gly Leu Phe Leu Leu Thr Thr Lys 
195 200 205 

Pro Val Leu Tyr Val Ala Asn Val Asp Glu Asp Val Val Ser Glu Pro 
210 215 220 

Asp Ser He Asp Tyr Val Lys Gin He Arg Glu Phe Ala Ala Thr Glu 
225 230 , 235 240 

40 Asn Ala Glu Val Val Val He Ser Ala Arg Ala Glu Glu Glu He Ser 

245 250 255 



30 



35 



45 



50 



Glu Leu Asp Asp Glu Asp Lys Lys Glu Phe Leu Glu Ala He Gly Leu 

260 265 270 

Thr Glu Ser Gly Val Asp Lys Leu Thr Arg Ala Ala* Tyr His Leu Leu 

275 280 285 

Gly Leu Gly Thr Tyr Phe Thr Ala Gly Glu Lys Glu Val Arg Ala Trp 

290 295 300 

Thr Phe Lys Arg Gly Met Lys Ma Pro Gin Ala Ala Gly He He His 

305 310 315 . 320 



55 Ser Asp Phe Glu Lys Gly Phe He Arg Ala Val Thr Met Ser Tyr Glu 

325 330 335 
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Asp Leu Val Lys Tyr Gly Ser Glu Lys Ala Val Lys Glu Ala Gly Arg 
340. 345 350 

Leu Arg Glu Glu Gly Lys Glu Tyr lie Val Gin Asp Gly' Asp lie Met 
5 355 360 365 

Glu Phe Arg Phe Asn Val 
370 

10 

<210> 159 
<211> 110 
<212> PRT 

<213> Streptococcus pneumoniae 

15 

<400> 159 

Met- Glu He Glu Lys Thr Asn Arg Met Asn Ala Leu Phe Glu Phe Tyr 
1 5 10 15 

20 Ala Ala Leu Leu Thr Asp Lys Gin Met Asn Tyr He Glu Leu Tyr Tyr 
20 25 30 

Ala Asp Asp Tyr Ser Leu Ala Glu He Ala Glu Glu Phe Gly Val Ser 
.35 * 40 45 

25 

Arg Gin Ala Val Tyr Asp Asn He Lys Arg Thr Glu Lys He Leu Glu 
50 55 60 

Asp Tyr Glu Met Lys Leu His Met Tyr Ser Asp Tyr He Val Arg Ser 
30 65 70 75 80 

Gin He Phe Asp Gin He Leu Glu Arg Tyr Pro Lys Asp Asp Phe Leu 
85 90 95 

35 Gin Glu Gin He Glu He Leu Thr Ser He Asp Asn Arg Glu 
• 100 105 110 



<210> 160 
40 <211> 223 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 160 

45 Met Thr Leu Glu Trp Glu Glu Phe Leu Asp Pro Tyr He Gin Ala Val 
15 10 15 

Gly Glu Leu Lys He Lys Leu Arg Gl^r lie Arg Lys Gin Tyr Arg Lys 
20 25 '30 

50 

Gin Asn Lys His Ser Pro He Glu Phe Val Thr Gly Arg Val Lys Pro 
35 40 45 

He Glu Ser He Lys Glu Lys Met Ala Arg Arg Gly He Thr Tyr Ala 
55 50 55 60 

Thr Leu Glu His Asp Leu Gin Asp He Ala Gly Leu Arg Val Met Val 
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65 70 75 80 

Gin Phe Val Asp Asp Val Lys Glu Val Val Asp lie Leu His Lys Arg 
85 90 95 

Gin Asp Met Arg He He Gin Glu Arg Asp Tyr He Thr His Arg Lys 
100 105 110 

Ala Ser Gly Tyr Arg Ser Tyr His Val Val Val Glu Tyr Thr Val Asp 
115 120 125 

Thr He Asn Gly Ala Lys Thr He Leu Ala Glu He Gin lie Arg Thr 
130 135 140 

Leu Ala Met Asn Phe Trp Ala Thr He Glu His Ser Leu Asn Tyr Lys 
145 150 155 160 

Tyr Gin Gly Asp Phe Pro Asp Glu He Lys Lys Arg Leu Glu He Thr 
165. 170 175 

Ala Arg He Ala His Gin Leu Asp Glu Glu Met Gly Glu He Arg Asp 
180 185 190 

Asp He Gin Glu Ala Gin Ala Leu Phe Asp Pro Leu Ser Arg Lys Leu 
195 200 205 

Asn Asp Gly Val Gly Asn Ser Asp Asp Thr Asp Glu Glu Tyr Arg 
210 215 220 



<210> 161 
<211> 195 
<212> PRT- 

<213> Streptococcus pneumoniae 
<400> 161 

Met Glu Leu Asn Thr His Asn Ala Glu He Leu Leu Ser Ala Ala Asn 
15 10 15 

Lys Ser His Tyr Pro Gin Asp Glu Leu Pro Glu He Ala Leu Ala Gly 
20 25 30 

Arg Ser Asn Val Gly Lys Ser Ser Phe He Asn Thr Met Leu Asn Arg 
35 40 45 

Lys Asn Leu Ala Arg Thr Ser Gly Lys Pro Gly Lys Thr Gin Leu Leu 
50 55 60 

Asn Phe Phe Asn He Asp Asp Lys Met Arg Phe Val Asp Val Pro Gly 
65 70 75 80 

Tyr Gly Tyr Ala Arg Val Ser Lys Lys Glu Arg Glu Lys Trp Gly Cys 
85 90 95 

Met He Glu Glu Tyr Leu Thr Thr Arg Glu Asn Leu Arg Ala Val Val 
. 100 105 HO 
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Ser Leu Val Asp Leu Arg His Asp 
115 120 

Tyr Glu Phe Leu Lys Tyr Tyr Glu 
130 135 

Lys Ala Asp Lys lie Pro Arg Gly 
145 150 

lie Lys Lys Lys Leu Asn Phe Asp 
165 

Ser Ser Val Ser Lys Ala Gly Met 
180 



Pro Ser Ala Asp Asp Val Gin Met 
125 

lie Pro Val lie He Val Ala Thr 
140 

Lys Trp Asn Lys His Glu Ser Ala 
155 160 

Pro Ser Asp Asp Phe He Leu Phe 
170 175 

Asp Glu Ala Trp Asp Ala He Leu 
185 190 



Glu Lys Leu 
195 



<210> 162 
<211> 97 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 162 

Met Lys Thr Arg Lys He Pro Leu Arg Lys Ser Val Val Ser Asn Glu 
15 10 15 

Val He Asp Lys Arg Asp Leu Leu Arg He Val Lys Asn Lys Glu Gly 
20 25 30 

Gin Val Phe He Asp Pro Thr Gly Lys Ala Asn Gly Arg Gly Ala Tyr 
35 40 45 

He Lys Leu Asp Asn Ala Glu Ala Leu Glu Ala Lys Lys Lys Lys Val 
50 55 60 

Phe Asn Arg Ser Phe Ser Met Glu Val Glu Glu Ser Phe Tyr Asp Glu 
65 70 75 80 

Leu He Ala Tyr Val Asp His Lys Val Lys Arg Arg Glu Leu Gly Leu 
85 90 95 

Glu 



<210> 163 
<211> 103 
<212> PRT 

<213> Streptococcus pneumoniae 
<400£r'163 

Met Leu Lys Pro Ser He Asp Thr Leu Leu Asp Lys Val Pro Ser Lys 
15 10 15 

Tyr Ser Leu Val lie Leu Glu Ala Lys Arg Ala His Glu Leu Glu Ala 
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20 

Gly Ala Pro Ala Thr Gin Gly 
35 

5 

Ala Leu Glu Glu lie Glu Ser 
50 55 

Glu Gly Lys Arg Glu Ala Val 
10 65 70 

Arg Lys Glu Glu Glu Glu Lys 
85 

15 Lys Glu Asp Gly Glu Lys He 
100 



25 30 

Phe Lys Ser Glu Lys Ser Thr Leu Arg 
40 45 

Gly Asn Val Thr He His Pro Asp Pro 
60 

Arg Arg Arg He Glu Glu Glu Lys Arg 
75 80 

Lys He Lys Glu Gin He Ala Lys Glu 
90 95 



<210> 164 
20 <211> 103 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 164 

25 Met Ser Leu Thr Ser Lys Gin Arg Ala Phe Leu Asn Ser Gin Ala His 
1 5 10 15 

Thr Leu Lys Pro He He Gin He Gly Lys Asn Gly Leu Asn Asp Gin 
20 25 30 



30 



He Lys Thr Ser Val Arg Gin Ala Leu Asp Ala Arg Glu Leu He Lys 
35 40 45 



Val Thr Leu Leu Gin Asn Thr Asp Glu Asn He His Glu Val Ala Glu 
35 50 55 - . 60 

He Leu Glu Glu Glu lie Gly Val Asp Thr Val Gin Lys He Gly Arg 
65 70 75 80 

40 He Leu He Leu Phe Lys Gin Ser Ser Lys Lys Glu Asn Arg Lys He 

85 90 95 



Ser Lys Lys Val Lys Glu He 
100 

45 

<210> 165 
<211> 175 
<212> PRT 
50 <213> Streptococcus pneumoniae 

<400> 165 

Met Ala He Glu Asn Tyr He Pro Asp Phe Ala Val Glu Ala Val Tyr 
15 10 15 

55 

Asp Leu Thr Val Pro Ser Leu Gin Ala Gin Gly He Lys Ala Val Leu 
20 25 30 
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Val Asp Leu Asp Asn Thr Leu lie Ala Trp Asn Asn Pro Asp Gly Thr 
35 40 45 

Pro Glu Met Lys Gin Trp Leu His Asp Leu Arg Asp Ala Gly lie Gly 
50 55 60 

He He Val Val Ser Asn Asn Thr Lys Lys. Arg Val Gln-Arg Ala Val 
65 70 75 80 

Glu Lys Phe Gly He Asp Tyr Val Tyr Trp Ala Leu Lys Pro Phe Thr 
85 90 95 

Phe Gly He Asp Arg Ala Met Lys Glu Phe His Tyr Asp Lys Lys Glu 
100 105 110 

Val Val Met Val Gly Asp Gin Leu Met Thr Asp He Arg Ala Ala His 
115 ' 120 125 

Arg Ala Gly He Arg Ser He Leu Val Lys Pro Leu Val Gin His Asp 
13.0 135 140 

Ser He Lys Thr Gin He Asn Arg Thr Arg Glu Arg Arg Val Met Arg 
145 150 155 160 

Lys He Thr Glu Lys Tyr Gly Pro He Thr Tyr Lys Lys Gly He 
165 170 175 



<210> 166 
<211> 455 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 166 

Met Phe Arg Lys He Leu He Ala Asn Arg Gly Glu He Ala Val Arg 
15 10 15 

He He Arg Ala Ala Arg Glu Leu Gly He Ala Thr Val Ala Val Tyr 
20 25 30 

Ser Thr Ala Asp Lys Glu Ala Leu His Thr Leu Leu Ala Asp Glu Ala 
35 40 45 

Val Cys He Gly Pro Gly Lys Ala Thr Glu Ser Tyr Leu Asn He Asn 
50 55 60 

Ala Val Leu Ser Ala Ala Val Leu Thr Glu Ala Glu Ala He His Pro 
65 70 75 80 

Gly ' Phe Gly Phe Leu Ser Glu Asn Ser Lys Phe Ala Thr Met Cys Glu 
85 90 95 

Glu lie- Gly He Lys Phe He Gly Pro Ser Gly His Val Met Asp Met 
100 105 HO 

Met Gly Asp Lys He Asn Ala Arg Ala Gin Met He Lys Ala Gly Val 
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115 



120 



125 



Pro Val He Pro Gly Ser Asp Gly Glu Val His Asn Ser Glu Glu Ala 
130 135 140. 

5 

Leu He Val Ala Glu Lys He Gly Tyr Pro Val Met Leu Lys Ala Ser 
145 - 150 155 160 

Ala Gly Gly Gly Gly Lys Gly He Arg Lys Val Glu Lys Pro Asp Asp 
10 165 170 175 

Leu Val Ser Ala Phe Glu Thr Ala Ser Ser Glu Ala Lys Ala Asn Tyr 
180 185 190 

15 Gly Asn Gly Ala Met Tyr He Glu Arg Val He Tyr Pro Ala Arg His 
195 200 205 



20 



lie Glu Val Gin He Leu Gly Asp Glu His Gly His Val He His Leu 
210 215 220 

Gly Glu Arg Asp Cys Ser Leu Gin Arg Asn Asn Gin Lys Val Leu Glu 

225 230 ■ 235 240 



Glu Ser Pro Ser He Ala He Gly Lys Thr Leu Arg His Glu He Gly 
25 245 250 255 

Ala Ala Ala Val Arg Ala Ala Glu Phe Val Gly Tyr Glu Asn Ala Gly 

260 265 270 

30 Thr He Glu Phe Leu Leu Asp Glu Ala Ser Ser Asn Phe Tyr Phe Met 
275 280 . 285 



35 



Glu Met Asn Thr Arg Val Gin Val Glu His Pro Val Thr Glu Phe Val 

290 295 300 

Ser Gly Val Asp He Val Lys Glu Gin He Cys He Ala Ala Gly Gin 

305 310 315 320 



Pro Leu Ser Val Lys Gin Glu Asp He Val Leu Arg Gly His Ala He 

40 325 330 335 

Glu Cys Arg He Asn Ala Glu Asn Pro Ala Phe Asn Phe Ala Pro Ser 
340 345 350 

45 Pro Gly Lys He Thr Asn Leu Tyr Leu Pro Ser Gly Gly Val Gly Leu 
3'55 360 365 



50 



Arg Val Asp Ser Ala Val Tyr Pro Gly Tyr Thr lie Pro Pro Tyr Tyr 
370 375 380 

Asp Ser Met He Ala Lys He He Val His Gly Glu Asn Arg Phe Asp 

385 390 395 400 



Ala Leu Met Lys Met Gin Arg Ala Leu Tyr Glu Leu Glu He Glu Gly 
55 405 410 415 



Val 'Gin Thr Asn Ala Asp Phe Gin Leu Asp Leu He Ser Asp Arg Asn 
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420 



425 



430 



Val He Ala Gly Asp Tyr Asp Thr Cys Phe Leu Met Glu Thr Phe Leu 
435 440 445 



Pro Lys Tyr Gin Glu Lys Glu 
450 455 



10 <210> 167 
<211> 77 
<212> PRT 

<213> Streptococcus pneumoniae 
15 <400> 167 

Met He Tyr Lys Val Phe Tyr Gin Glu Thr Lys Glu Arg Ser Pro Arg 
1.5 10 15 

Arg Glu Thr Thr Arg Ala Leu Tyr Leu Asp He Asp Thr Ser Ser Glu 
20 20 25 30 

Leu Glu Gly Arg He Thr Ala Arg Gin Leu Val Glu Glu Asn Arg Pro 
35 40 45 

25 Glu Tyr Asn He Glu Tyr He Glu Leu Leu Ser Asp Lys Leu Leu Asp 
50 55 60 

Tyr Glu Lys Glu Thr Gly Ala Phe Glu He Thr Glu Phe 
65 70 75 

30 

<210> 168 

<211> 336 

<212> PRT 

35 <213> Streptococcus pneumoniae 

<400> 168 

Met Lys Asp Arg Tyr lie Leu Ala Phe Glu Thr Ser Cys Asp Glu Thr 
1 '5 . 10 15 

40 

Ser Val Ala Val Leu Lys Asn Asp Asp Glu Leu Leu Ser Asn Val He 
20 25 30 

Ala Ser Gin He Glu- Ser His Lys Arg Phe Gly Gly Val Val Pro Glu 
45 35 40 45 

Val Ala Ser Arg His His Val Glu Val He Thr Ala Cys He Glu Glu 
50 55 60 

50 Ala Leu Ala Glu Ala Gly He Thr Glu Glu Asp Val Thr Ala Val Ala 
65 70 75 80 

Val Thr Tyr Gly Pro Gly Leu Val Gly Ala Leu Leu Val Gly Leu Ser 
85 90 95 

55 

Ala Ala Lys Ala Phe Ala Trp Ala His Gly Leu Pro Leu He Pro Val 
100 105 110 
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Asn His Met Ala Gly His Leu Met Ala Ala Gin Ser Val Glu Pro Leu 
115 120 125 

Glu Phe Pro Leu Leu Ala Leu Leu Val Ser Gly Gly His Thr Glu Leu 
130 135 140 

Val Tyr Val Ser Glu Ala Gly Asp Tyr Lys He Val Gly Glu Thr Arg 
145 150 155 1 160 

Asp Asp Ala Val Gly Glu Ala Tyr Asp Lys Val Gly Arg Val Met Gly 
165 * 170 175 

Leu Thr Tyr Pro Ala Gly Arg Glu He Asp Glu Leu Ala His Gin Gly 
180 185 190 " 

Gin Asp He Tyr Asp Phe Pro Arg Ala Met He Lys Glu Asp Asn Leu 
195 200 205 

Glu Phe Ser Phe Ser Gly Leu Lys Ser Ala Phe He Asn Leu His His 
210 215 220 

Asn Ala Glu Gin Lys Gly Glu Ser Leu Ser Thr Glu Asp Leu Cys Ala 
225 230 . 235 240 

Ser Phe Gin Ala Ala Val Met Asp He Leu Met Ala Lys Thr Lys Lys 
245 250 255 

Ala Leu Glu Glu Tyr Pro Val Lys Thr Leu Phe Val Ala Gly Gly Val 
260 265 270 

Ala Ala Asn Lys Gly Leu Arg Glu Arg Leu Ala Ala Glu He Thr Asp 
275 280 285 

Val Lys Val He He Pro Pro Leu Arg Leu Cys Gly Asp Asn Ala Gly 
290 295 300 

Met He Ala Tyr Ala Ser Val Ser Glu Trp. Asn Lys Glu Asn Phe Ala 
305 310 315 320 



Gly Trp Asp Leu Asn. Ala Lys Pro Ser Leu Ala Phe Asp Thr Met Glu 
325 330 335 



<210> 169 
<211> 602 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 169 

Met .Cys Gly He Val Gly Val Val Gly Asn Thr Asn Ala Thr Asp He 
1 5 . 10' 15 

Leu He Gin Gly Leu Glu Lys Leu Glu Tyr Arg Gly Tyr Asp Ser Ala 
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20 



25 



30 



Gly He Phe Val Leu Asp Gly Ala Asp Asn His Leu Val Lys Ala Val 
35 40 45 

Gly Arg He Ala Glu Leu Ser Ala Lys Thr Ala Gly Val Glu Gly Thr 
50 55 60 

Thr Gly He Gly His- Thr Arg Trp Ala Thr His Gly Lys Pro Thr Glu 
65 70 75 80 

Asp Asn Ala His Pro His Arg Ser Glu Thr Glu Arg Phe Val Leu Val 
85 90 95 

His Asn Gly Val He Glu Asn Tyr Leu Glu He Lys Glu Glu Tyr Leu 
100 105 110 

Ala Gly His His Phe Lys Gly Gin Thr Asp Thr Glu He Ala Val His 
115 120 125 

Leu lie Gly Lys Phe Ala Glu Glu Glu Gly Leu Ser Val Leu Glu Ala 
• 130 135 140 

Phe Lys Lys Ala Leu His He lie Arg Gly Ser Tyr Ala Phe Ala Leu 
145 150 155 160 

lie Asp Ser Glu Asn Pro Asp Val lie Tyr Val Ala Lys Asn Lys Ser 
165 ' 170 175 

. Pro Leu Leu He Gly Leu Gly Glu Gly Tyr Asn Met Val Cys Ser Asp 
180 185 190 

Ala Met Ala Met He Arg Glu Thr Asn Gin Tyr Met Glu He His Asp 
■ 195 200 205 

Gin Giu Leu Val He Val Lys Ala Asp Ser Val Glu Val Gin Asp Tyr 
210 215 220 

Asp Gly Asn Ser Arg Glu Arg Ala Ser Tyr Thr Ala Glu Leu Asp Leu 
225 230 235 240.. 

Ser Asp He Gly Lys Gly Thr Tyr Pro Tyr Tyr Met Leu Lys Glu He 
245 250 255 

Asp Glu Gin Pro Thr Val Met Arg Lys Leu He Gin Ala Tyr Thr Asp 
260 265 270 

Asp Ala Gly Gin Val Val Val Ala Pro Ala lie lie Lys Ala Val Gin 
275 280 285 

Asp Ala Asp Arg lie Tyr lie Leu Ala Ala Gly Thr Ser Tyr His Ala 
290 295 300 

Gly Phe Ala Ser Lys Lys Met Leu Glu Glu Leu Thr Asp Thr Pro Val 
305 310 315 320 



Glu Leu Gly He Ser Ser Glu Trp Gly Tyr Gly Met Pro Leu Leu Ser 



108 



WO 01/49721 



PCT/USOO/35604 



325 330 335 

Lys Lys Pro Leu Phe lie Phe lie Ser Gin Ser Gly Glu Thr Ala Asp 
340 345 350 

- 5 

Ser Arg Gin Val Leu Val Lys Ala Asn Glu Met Gly He Pro Ser Leu 
355 360 365 ' 

Thr Val Thr Asn Val Pro Gly Ser Thr Leu Ser Arg Glu Ala Asn Tyr 
10 370 375 380 

Thr Met Leu Leu His Ala Gly Pro Glu He Ala Val Ala Ser Thr Lys 
385 390 395 400 

15 Ala Tyr Thr Ala Gin He Ala Ala Leu Ala Phe Leu Ala Lys Ala Val 

405 410 415 

Gly Glu Ala Asn Gly Asn Ala Lys Ala Gin Ala Phe Asp Leu Val His 
420 425 430 

20 

Glu Leu Ser He Val Ala Gin Ser He Glu Ser Thr Leu Ser Glu Lys 
435 440 445 

Glu Thr He Glu Ala Lys Val Arg Glu Leu Leu Glu Thr Thr Arg Asn 
25 450 455 460 

Ala Phe Tyr He Gly Arg Gly Gin Asp Tyr Tyr Val Ala Met Glu Ala 
465 470 475 480 

30 Ser Leu Lys Leu Lys Glu He Ser Tyr lie Gin Cys Glu Gly Phe Ala 

485 490 495 

Ala Gly Glu Leu Lys His Gly Thr He Ala Leu He Glu Glu Gly Thr 
500 505 510 

35 

Pro Val Leu Ala Leu Leu Ser Asp Pro Val Leu Ala Asn His Thr Arg 
515 520 525 

Gly Asn He Gin Glu Val Ala Ala Arg Gly Ala Lys Val Leu Thr He 
40 530 535 540 

Ala Glu Glu Asn Val Ala Lys Asp Thr Asp Asp He Val Leu Thr Thr 
545 550 555 560 

45 Val His Pro Tyr Leu Ser Pro He Ser Met Val Val Pro Thr Gin Leu 

565 570 575 

Val Ala Tyr Phe Ala Thr Leu His Arg Gly Leu Asp Val Asp Lys Pro 
580 58.5 590 

50 

Arg Asn Leu Ala Lys Ser Val Thr Val Glu 
595 600 



55 <210> 170 
<211> 240 
<212> PRT 
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<213> Streptococcus pneumoniae 
<400> 170 

Met He Arg He Glu Asn Leu Ser Val Ser Tyr Lys Glu Thr Leu Ala 
5 1 5 10 15 

Leu Lys Asp He Ser Leu Val Leu His Gly Pro Thr He Thr Gly He 
20 25 30 

10 He Gly Pro Asn Gly Ala Gly Lys Ser Thr Leu Leu Lys Gly Met Leu 
35 40 45 



15 



30 



45 



50 



Gly He He Pro His Gin Gly Gin Ala Phe Leu Asp Asp Lys Glu Val 

50 55 60 

Lys Lys Ser Leu His Arg He Ala Tyr Val Glu Gin Lys He Asn He 

65 70 75 80 



Asp Tyr Asn Phe Pro He Lys Val Lys Glu Cys Val Ser Leu Gly Leu 
20 85 90 95 

Phe Pro Ser He Pro Leu Phe Arg Ser Leu Lys Ala Lys His Trp Lys 
100 105 110 

25 Lys Val Gin Glu Ala Leu Glu He Val Gly Leu Ala Asp Tyr Ala Glu 
115 120 125 



Arg Gin He Ser Gin Leu Ser Gly Gly Gin Phe Gin Arg Val Leu He 
130 135 140 

"Ala Arg Cys Leu Val Gin Glu Ala Asp Tyr He Leu Leu Asp Glu Pro 
145 150 155 160 



Phe Ala Gly lie Asp Ser Val Ser Glu Glu He He Met Asn Thr Leu 
35 165 170 175 

Arg Asp Leu Lys Lys Ala Gly Lys Thr Val Leu He Val His His Asp 
180 185 190 

40 Leu Ser Lys He Pro His Tyr Phe Asp Gin Val Leu Leu Val Asn Arg 
195 200 205 



Glu Val He Ala Phe Gly Pro Thr Lys Glu Thr Phe Thr Glu Thr Asn 
210. 215 220 

Leu Lys Glu Ala Tyr Gly Asn Gin Leu Phe Phe Asn Gly Gly Asp Leu 

225 230 - 235 240 



<210> 171 • . 

<211> 740 
55 <212> PRT 

<213> Streptococcus pneumoniae 
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10 



<4.00> 171 

Met Pro Lys Glu Val Asn Leu Thr Gly Glu Glu Val Val Ala Leu Thr 
1 5 10 15 

Lys Glu Tyr Leu Thr Glu Glu Asp Val His Phe Val His Lys Ala Leu 
20 2.5 30 

Val Tyr Ala Val Glu Cys His Ser Gly Gin Tyr Arg Lys Ser Gly Glu 
35 40 45 

Pro Tyr lie He His Pro He Gin Val Ala Gly He Leu Ala Lys Leu 

50 .,r 55 60 



Lys Leu Asp Ala Val Thr Val Ala Cys Gly Phe Leu His Asp Val Val 
15 65 70 75 80 

Glu Asp Thr Asp Ala Thr Leu Asp Asp Leu Glu Arg Glu Phe Gly Pro 
85 90 95 

20 Asp Val Arg Val He Val Asp Gly Val Thr Lys Leu Gly Lys Val Glu 
100 105 110 



25 



Tyr Lys Ser He Glu Glu Gin Leu Ala Glu Asn His Arg Lys Met Leu 
115 120 125 

Met Ala Met Ser Glu Asp He Arg Val lie Leu Val Lys Leu Ser Asp 

130 135 140 



Arg Leu His Asn Met Arg Thr Leu Lys His Leu Arg Lys Asp Lys Gin 

30 145 150 155 160 

Glu Arg He Ser Lys Glu Thr Met Glu He Tyr Ala Pro Leu Ala His 
165 170 175 

35 Arg Leu Gly He Ser Ser Val Lys Trp Glu Leu Glu Asp Leu Ser Phe 
180 185 190 



40 



Arg Tyr Leu Asn Pro Thr Glu Phe Tyr Lys He Thr His Met Met Lys 
195 200 205 

Glu Lys Arg Arg Glu Arg Glu Ala Leu Val Asp Glu Val Val Thr Lys 
210 215 220 



Leu Glu Glu Tyr Thr Thr Glu Arg His Leu Lys Gly Lys He Tyr Gly 
45 225 230 235 240 

Arg Pro Lys His He Tyr Ser He Phe -Arg Lys Met Gin Asp Lys Arg 
245 250 255 

50 Lys Arg Phe Glu Glu He Tyr Asp Leu He Ala He Arg Cys He Leu 
260 265 270 



55 



Asp Thr Gin Ser Asp Val Tyr Ala Met Leu Gly Tyr Val His Glu Phe 
275 280 285 

Trp Lys Pro Met Pro Gly Arg Phe Lys Asp Tyr He Ala Asn Arg Lys 
290 295 300 
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10 



Ala Asn Gly Tyr Gin Ser He His Thr Thr Val Tyr Gly Pro Lys Gly 
305 - 310 315 320 

Pro He Glu Phe Gin He Arg Thr Lys Glu Met His Glu Val Ala Glu 
325 330 335 

Tyr Gly Val Ala Ala His Trp Ala Tyr Lys Lys Gly He Lys Gly Gin 
340 345 350 

Val Asn Ser Lys Glu Ser Ala He Gly Met Asn Trp lie Lys Glu Met 
355 360 365 



Met Glu Leu Gin Asp Gin Ala Asp Asp Ala Lys .Glu Phe Val Asp Ser 
15 370 375 380 

Val Lys Glu Asn Tyr Leu Ala Glu Glu He Tyr Val Phe Thr Pro Asp 

385 390 395 400 

20 Gly Ala. Val Arg Ser Leu Pro Lys Asp Ser Gly Pro He Asp Phe Ala 

405 410 415 



25 



Tyr Glu He His Thr Lys Val Gly Glu Lys Ala Thr Gly Ala Lys Val 
420 425 430 

Asn Gly Arg Met Val Pro Leu Thr Thr Lys Leu Lys Thr Gly Asp Gin 
435 440 445 



Val Glu He He Ala Asn Pro Asn Ser Phe Gly Pro Ser Arg Asp Trp 
30 450 455 460 

Leu Asn Met Val Lys Thr Ser Lys Ala Arg Asn Lys He Arg Gin Phe 
465 470 475 480 

35 Phe Lys Asn Gin Asp Lys Glu Leu Ser Val Asn Lys Gly Arg Glu Met 

485 490 495 



40 



Leu Met Ala Gin Phe Gin Glu Asn Gly Tyr Val Ala Asn Lys Phe Met 
500 505 510 

Asp Lys Arg His Met Asp Gin Val Leu Gin Lys Thr Ser Tyr Lys Thr 
515 520 525 



Glu Asp Ser Leu Phe Ala Ala He Gly Phe Gly Glu He Gly Ala He 
45 530 . 535 540 

Thr Val Phe Asn Arg Leu Thr Glu Lys Glu Arg Arg Glu Glu Glu Arg 
545 550 555 560 

50 Ala Lys Ala Lys Ala Glu Ala Glu Glu Leu Val Lys Gly Gly Glu Val 

565 570 575 



55 



Lys Val Glu Asn Lys Glu Thr Leu Lys Val Lys His Glu Gly Gly Val 
580 585 590 

Val He Glu Gly Ala Ser Gly Leu Leu Val Arg He Ala Lys Cys Cys 
595 600 605 
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Asn Pro Val Pro Gly Asp Asp lie Val Gly Tyr lie Thr Lys Gly Arg 
610 615 620 

Gly Val Ala lie His Arg Val Asp Cys Met Asn Leu Arg Ala Gin Glu 
625 -630 635 640 

Asn Tyr Glu Gin Arg Leu Leu Asp Val Glu Trp Glu Asp Gin Tyr Ser 
645 650 655 

Ser Ser Asn Lys Glu Tyr Leu Ala His lie Asp lie Tyr Gly Leu Asn 
660 665 670 

Arg Thr Gly Leu Leu Asn Asp Val Leu Gin Val Leu Ser Asn Thr Thr 
675 680 685 

Lys Asn He Ser Thr Val Asn Ala Gin Pro Thr Lys Asp Met Lys Phe 
690 695 700 

Ala Asn He His Val Ser Phe Gly He Ala Asn Leu Ser Thr Leu Thr 
705 710 715 720 

Thr Val Val Asp Lys He Lys Ser Val Pro Glu Val Tyr Ser Val Lys 
725 730 735 

Arg Thr Asn Gly 
740 



<210> 172 
<211> 492 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 172 

Met Ser Asn Trp Asp Thr Lys Phe Leu Lys Lys Gly Phe Thr Phe Asp 
1 5 .10 15 

Asp Val Leu Leu He Pro Ala Glu Ser His Val Leu Pro Asn Asp Ala 
20 25 30 

Asp Leu Thr Thr Lys Leu Ala Asp Asn Leu Thr Leu Asn He Pro He 
35 40 45 

He Thr Ala Ala Met Asp Thr Val Thr Glu Ser Gin Met Ala He Ala 
50 55 60 

lie Ala Arg Ala Gly Gly Leu Gly Val He His Lys Asn Met Ser He 
65 70 75 80 

Ala Gin Gin Ala Asp Glu Val -Arg Lys Val Lys Arg Ser Glu Asn Gly 
85 90 95 " 

Val He He Asp Pro Phe Phe Leu Thr Pro Glu His Thr lie Ala Glu 
100 105 110 

Ala Asp Glu Leu Met Gly Arg Tyr Arg He Ser Gly Val Pro Val Val 
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115 



120 



125 



Glu Thr Leu Glu Asn Arg Lys Leu Val Gly lie Leu Thr Asn Arg Asp 
130 135 140 

5 

Leu Arg Phe lie Ser Asp Tyr Asn Gin Pro lie Ser Asn His Met Thr 
145 150 155 160 

Ser Glu Asn Leu Val Thr Ala Pro Val Gly Thr Asp Leu Ala Thr Ala 
10 165 170 175 

Glu Ser He Leu Gin Glu His Arg lie .Glu Lys Leu Pro Leu Val Asp 
180 185 190 

15 Glu Glu Gly Ser Leu Ser Gly Leu He Thr He Lys Asp He Glu Lys 
195 200 205 



20 



Val He Glu Phe Pro Asn Ala Ala Lys Asp Glu Phe Gly Arg Leu Leu 
210 215 • 220 

Val Ala Gly Ala Val Gly Val Thr Ser Asp Thr Phe Glu Arg Ala Glu 
225 230 235 240 



Ala Leu Phe Glu Ala Gly Ala Asp Ala He Val He Asp Thr Ala His 
25 245 250 255 

Gly His Ser Ala Gly Val Leu Arg Lys He Ala Glu He Arg Ala His 
260 265 270. 

30 Phe Pro Asp Arg Thr Leu He Ala Gly Asn He Ala Thr Ala Glu Gly 
275 280 285 



35 



Ala Arg Ala Leu Tyr Glu Ala Gly Val Asp Val Val Lys Val Gly He 

290 295 300 

Gly Pro Gly Ser He Cys Thr Thr Arg Val He Ala Gly Val Gly Val 

305 310 315 ' 320 



Pro Gin Val Thr Ala He Tyr Asp Ala Ala Ala Val Ala Arg Glu Tyr 
40 325 330 335 

Gly Lys Thr He He Ala Asp Gly Gly He Lys Tyr Ser Gly Asp He 

340 345 350 

45 Val Lys Ala Leu Ala Ala Gly Gly Asn Ala Val Met Leu Gly Ser Met 

355 360 365 



50 



55 



Phe Ala Gly Thr Asp Glu Ala Pro Gly Glu Thr Glu He Phe Gin Gly 
370 375 380 

Arg Lys Phe Lys Thr Tyr Arg Gly Met Gly Ser He Ala Ala Met Lys 
385 390 395^ 400 

Lys Gly Ser Ser Asp Arg Tyr Phe Gin Gly Ser Val Asn Glu Ala Asn 
405 410 415 

Lys Leu Val Pro Glu Gly He Glu Gly Arg Val Ala Tyr Lys Gly Ala 
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420 



425 



430 



Ala Ala Asp He Val Phe Gin Met He Gly Gly lie Arg Ser Gly Met 
435 440 445 

Gly Tyr Cys Gly Ala Ala Asn Leu Lys Glu Leu His Asp Asn Ala Gin 
450 455 460 

Phe He Glu Met Ser Gly Ala Gly Leu Lys Glu Ser His Pro His Asp 
465 470 475 480 



Val Gin lie Thr Asn Glu Ala Pro Asn Tyr Ser Met 
485 490 



<210> 173 
<211> 648 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 173 

Met Thr Glu Glu He Lys Asn Leu Gin Ala Gin Asp Tyr Asp Ala Ser 
15 10 15 

Gin He Gin Val Leu Glu Gly Leu Glu Ala Val Arg Met Arg Pro Gly 
20 25 30 

Met Tyr He Gly Ser Thr Ser Lys Glu Gly Leu His His Leu Val Trp 
35 40 4 5 

Glu He Val Asp Asn Ser He Asp Glu Ala Leu Ala Gly Phe Ala Ser 
50 55 60 

His He Gin Val Phe He Glu Pro Asp Asp Ser He Thr Val Val Asp 
65 70 75 80 

Asp Gly Arg Gly He Pro Val Asp He Gin Glu Lys Thr Gly Arg Pro 
85 90 95 

Ala Val Glu Thr Val Phe Thr Val Leu His Ala Gly Gly Lys Phe Gly 
100 105 110 

Gly Gly Gly Tyr Lys Val Ser Gly Gly Leu His Gly Val Gly Ser Ser 
115 120 125 

Val Val Asn Ala Leu Ser Thr Gin Leu Asp Val His Val His Lys Asn 
130 135 140 

Gly Lys He His Tyr Gin Glu Tyr Arg Arg Gly His Val Val Ala Asp 
145 150 155 . 160 

Leu Glu He Val Gly Asp Thr Asp Lys Thr Gly Thr Thr Val His Phe 
165 170 175 

Thr Pro Asp Pro Lys He Phe Thr Glu Thr Thr He Phe Asp Phe Asp 
180 185 ' 190 
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Lys Leu Asn Lys Arg He Gin Glu Leu Ala Phe Leu Asn Arg Gly Leu 
195 • 200 205 

Gin He Ser He Thr Asp Lys Arg Gin Gly Leu Glu Gin Thr Lys His 
210 215 220 

Tyr His Tyr Glu Gly Gly He Ala Ser Tyr Val Glu Tyr He Asn Glu 
225 230 235 240 

Asn Lys Asp Val He Phe Asp Thr Pro He Tyr Thr Asp Gly Glu Met 
245 250 255 

Asp Asp He Thr Val Glu Val Ala Met Gin Tyr Thr Thr Gly Tyr His 
260 265 270 

Glu Asn Val Met Ser Phe Ala Asn Asn He His Thr His Glu Gly Gly 
275 280 285 

Thr His Glu Gin Gly Phe Arg Thr Ala Leu Thr Arg Val He Asn Asp 
290 295 300 

Tyr Ala Arg Lys Asn Lys Leu Leu Lys Asp Asn Glu Asp Asn Leu Thr 
305 310 315 320 

Gly Glu Asp Val Arg Glu Gly Leu Thr Ala Val He Ser Val Lys His 
325 330 . 335 

Pro Asn Pro Gin Phe Glu Gly Gin Thr Lys Thr Lys Leu Gly Asn Ser 
340 345 350 

Glu Val Val Lys He Thr Asn Arg Leu Phe Ser Glu Ala Phe Ser Asp 
355 360 * 365 



Phe Leu Met Glu Asn Pro Gin He Ala Lys Arg He Val Glu Lys Gly 
370 375 380 

He Leu Ala Ala Lys Ala Arg Val Ala Ala Lys Arg Ala Arg Glu Val 
385' . 390 395 400 

Thr Arg Lys Lys Ser Gly Leu Glu He Ser Asn Leu Pro Gly Lys Leu 
405 410 415 

Ala Asp Cys Ser Ser Asn Asn Pro Ala Glu Thr Glu Leu Phe He Val 
420 425 430 

Glu Gly Asp Ser Ala Gly Gly Ser Ala Lys Ser Gly Arg Asn Arg Glu 
435 440 445 

Phe Gin Ala He Leu Pro He Arg Gly Lys He Leu Asn Val Glu Lys 
450 455 460 - 

Ala Ser Met Asp Lys He Leu Ala Asn Glu Glu He Arg Ser Leu Phe 
465 470 475 480 

Thr Ala Met Gly Thr Gly Phe Gly Ala Glu Phe Asp Val Ser Lys Ala 
485 490 495 
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Arg Tyr Gin Lys Leu Val Leu Met Thr Asp Ala Asp Val Asp Gly Ala 
500 505 510 

His He Arg Thr Leu Leu Leu Thr Leu He Tyr Arg Tyr Met Lys Pro 
5 515 520 525 

lie Leu Glu Ala Gly Tyr Val Tyr He Ala Gin Pro Pro He Tyr Gly 
530 535 540 

10 Val Lys Val Gly Ser Glu He Lys Glu Tyr He Gin Pro Gly Ala Asp 
545 550 555 560 

Gin Glu He Lys Leu Gin Glu Ala Leu Ala Arg Tyr Ser Glu Gly Arg 
565 570 575 

15 

Thr Lys Pro Thr He Gin Arg Tyr Lys Gly Leu Gly Glu Met Asp Asp 
580 585 590 

His Gin Leu Trp Glu Thr Thr Met Asp Pro Glu His Arg Leu Met Ala 
20 595 600 605 

Arg Val Ser Val Asp Asp Ala Ala Glu Ala Asp Lys He Phe Asp . Met 
610 . 615 620 

25 Leu Met Gly Asp Arg Val Glu Pro Arg Arg Glu Phe He Glu Glu Asn 
625 630 635 640 

Ala Val Tyr Ser Thr Leu Asp Val 
645 

30 

<210>' 174 
<211> 88 
<212> PRT 

35 <213> Streptococcus pneumoniae 
<400> 174 

Met Gly Phe Thr Glu Glu Thr Val Arg Phe Lys Leu Asp Asp Ser Asn 
15 10 15 

40 

Lys Lys Glu He Ser Glu Thr Leu Thr Asp Val Tyr Ala Ser Leu Asn 
20 25 30 

Asp Lys Gly Tyr Asn Pro He Asn Gin He Val Gly Tyr Val Leu Ser 
45 35 40 45 

Gly Asp Pro Ala Tyr Val Pro Arg' Tyr- Asn Asn Ala Arg Asn Gin He 
50 55 60 

50 Arg Lys Tyr Glu Arg Asp Glu He Val Glu Glu Leu Val Arg Tyr Tyr 
65 70 75 80 

Leu Lys Gly Gin Gly Val Asp Leu 
85 

55 

<210> 175 
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<211> 198 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 175 

Met Val Asn Tyr Pro His Lys Val Ser Ser Gin Asp Arg Gin Thr Ser 
1 5 10 15 

Leu Ser Gin Pro Lys Asn Phe Ala Asn Arg Gly Met Ser Phe Glu Lys 
20 25 30 

Met He Asn Ala Thr Asn Asp Tyr Tyr Leu Ser Gin Gly Leu Ala Val 
35 40 45 

He His Lys Lys Pro Thr Pro He Gin He Val Gin Val Asp Tyr Pro 
50 55 60 

Gin Arg Ser Arg Ala Lys He Val Glu Ala Tyr Phe Arg Gin Ala Ser 
65 70 75 80 

Thr Thr Asp Tyr Ser Gly Val Tyr Asn Gly Tyr Tyr He Asp Phe Glu 
85 90 95 

Val Lys Glu Thr Lys Gin Lys Arg Ala lie Pro Met Lys Asn Phe His 
100 105 110 

Pro His Gin He Gin His Met Glu Gin Val Leu Ala Gin Gin Gly He 
115 120 125 ' 

Cys Phe Val Leu Leu His Phe Ser Ser Gin Gin Glu Thr Tyr Leu Leu 
130 135 140 

Pro Ala Phe Asp Leu He Arg Phe Tyr His Gin Asp Lys Gly Gin Lys 
145 150 155 160 

Ser Met Pro Leu Glu Tyr He Arg Glu Tyr Gly Tyr Glu He Lys Ala 
165' 170' 175 

Gly Ala Phe Pro Gin He Pro Tyr Leu Asn Val He Lys Glu His Leu 
180 185 190 

Leu Gly Gly Lys Thr Arg 
195 



<2"10> 176 
<211> 288 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 176 

Met Ala Leu Phe Ser Lys Lys Asp Lys Tyr He Arg He Asn Pro Asn 
15 10 15 

Arg Ser Val Arg Glu Lys Pro Gin Ala Lys Pro Glu Val Pro Asp Glu 
20 25 30 
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Leu Phe Ser Gin Cys Pro Gly Cys Lys His Thr lie Tyr Gin Lys Asp 
35 4 0 45 

Leu Gly Ser Glu Arg lie Cys Pro His Cys Ser Tyr Thr Phe Arg lie 
50 55 60 

Ser Ala Gin Glu Arg Leu Ala Leu Thr lie Asp Met Gly Thr Phe Lys 
65 70 75 80 

Glu Leu Phe Thr Gly lie Glu Ser Lys Asp Pro Leu His Phe Pro Gly 
85 90 95 

Tyr Gin Lys Lys Leu Ala Ser Met Arg Glu Lys Thr Gly Leu His Glu 
100 105 110 . 

Ala Val Val Thr Gly Thr Ala Leu lie Lys Gly Gin Thr Val Ala Leu 
115 120 125 

Gly He Met Asp Ser Asn Phe He Met Ala Ser Met Gly Thr Val Val 
130 135 140 

Gly Glu Lys He Thr Arg Leu Phe Glu Tyr Ala Thr Val Glu Lys Leu 
145 150 155 160 

Pro Val Val Leu Phe Thr Ala Ser Gly Gly Ala Arg Met Gin Glu Gly 
165 170 175 

He Met Ser Leu Met Gin Met Ala Lys lie Ser Ala Ala Val Lys Arg 
180 185 190 

His Ser Asn Ala Gly Leu Phe Tyr Leu Thr He Leu Thr Asp Pro Thr 
195 200 205 

Thr Gly Gly Val Thr Ala Ser Phe Ala Met Glu Gly Asp He He Leu 
210 215 220 

Ala Glu Pro Gin Ser Leu Val Gly Phe Ala Gly Arg Arg Val He Glu 
225 230 235 240 

Asn Thr Val Arg Glu Ser Leu Pro Glu Asp Phe Gin Lys Ala Glu Phe 



245 



250 



255 



Leu Leu Glu His Gly Phe Val Asp Ala He Val Lys Arg Arg Asp Leu 
260 265 270 



Pro Asp Thr He Ala Ser Leu Val Arg Leu His Gly Gly Ser Pro Arg 
275 280 285 



<210> 
<211> 
<212> 
<213> 



177 
139 
PRT 



Streptococcus pneumoniae 
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<400> 177 

Met Arg lie Met Gly Leu Asp Val Gly Ser Lys Thr Val Gly Val Ala 
1 5 10 15 

He Ser Asp Pro Leu Gly Phe Thr Ala "Gin Gly Leu Glu He He Gin 
20 25 30 

He Asn Glu Glu Gin Gly Gin Phe Gly Ser Asp Arg Val Lys Glu Leu 
35 40 45 

Val Asp Thr Tyr Lys Val Glu Arg Phe Val Val Gly Leu Pro Lys Asn 
50 55 60 

Met Asn Asn Thr Ser Gly Pro Arg Val Glu Ala Ser Gin Ala Tyr Gly 
65 70 75 80 

Ala Lys Leu Glu Glu Phe Phe Gly Leu Pro Val Asp Tyr Gin Asp Glu 
85 90 95 

Arg Leu Thr- Thr Val Ala Ala Glu Arg Met Leu He Glu Gin Ala Asp 
100 105 110 

He Ser Arg Asn Lys Arg Lys Lys Val He Asp Lys Leu Ala Ala Gin 
115. 120 , 125 

Leu He Leu Gin Asn Tyr Leu Asp Arg Lys Phe 
130 135 



<210> 178 
- <211> 398 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 178 

Met Ala Lys Leu Thr Val Lys Asp Val Asp Leu Lys Gly Lys Lys Val 
15 10 15 

Leu Val Arg Val Asp Phe Asn Val Pro Leu Lys Asp Gly Val He Thr 
20 25 30 

Asn Asp Asn Arg He Thr Ala Ala Leu Pro Thr He Lys Tyr He He 
35 40 45 

Glu Gin Gly Gly Arg Ala He Leu Phe Ser His Leu Gly Arg Val Lys 
50 55 60 

Glu Glu Ala Asp Lys Ala Gly Lys Ser Leu Ala Pro. Val Ala Ala Asp 
65 70 75 80 

Leu Ala Ala Lys Leu Gly Gin Asp Val Val Phe Pro Gly Val Thr Arg 
.85 '90 95 

Gly Ala Glu Leu Glu Ala Ala He Asn Ala Leu Glu Asp Gly Gin Val 
100 105 110 

Leu Leu Val Glu Asn Thr Arg Tyr Glu Asp Val Asp Gly Lys Lys Glu 
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115 120 125 

Ser Lys Asn Asp Pro Glu Leu Gly Lys Tyr Trp Ala Ser Leu Gly Asp 
130 135 140 

5 

Gly He Phe Val Asn Asp Ala Phe Gly Thr Ala His Arg Ala His Ala 
145 150 155 160 

Ser Asn Val Gly He Ser Ala Asn Val Glu Lys Ala Val Ala Gly Phe 
10 165 170 175 

Leu Leu Glu Asn Glu He Ala Tyr He Gin Glu Ala Val Glu Thr Pro 
180 185 190 

15 Glu Arg Pro Phe Val Ala He Leu Gly Gly Ser Lys Val Ser Asp Lys 
195 200 205 

lie Gly Val He Glu Asn Leu Leu Glu Lys Ala Asp Lys Val Leu He 
210 215 220 

20 

Gly Gly Gly Met Thr Tyr Thr Phe Tyr Lys Ala Gin Gly He Glu He 
225 230 235 240 

Gly Asn Ser Leu Val Glu Glu Asp Lys Leu Asp Val Ala Lys Ala Leu 
25 245 ,250 255 

Leu Glu Lys Ala Asn Gly Lys Leu He Leu Pro Val Asp Ser Lys Glu 
260 265 270 

30 Ala Asn Ala Phe Ala Gly Tyr Thr Glu Val Arg Asp Thr Glu Gly Glu 
275 280 285 

Ala Val Ser Glu Gly Phe Leu Gly Leu Asp He Gly Pro Lys Ser He 
290 295 300 

35 

Ala Lys Phe Asp Glu Ala Leu Thr Gly Ala Lys Thr Val Val Trp Asn 
305 310 315 320 

Gly Pro Met Gly Val Phe Glu Asn Pro Asp Phe Gin Ala Gly Thr He 
40 325 330 335 

Gly Val Met Asp Ala He Val Lys Gin Pro Gly Val Lys Ser He He 
340 345 350 

45 Gly Gly Gly Asp Ser Ala Ala Ala Ala He Asn Leu Gly Arg Ala Asp 
355 360 365 

Lys Phe Ser Trp He Ser Thr Gly Gly Gly Ala Ser Met Glu Leu Leu 
370 375 380 

50 

Glu Gly Lys Val Leu Pro Gin Leu Ala Ala Leu Thr Glu Lys 
385 .390 395 



55 <2l"0> 179 
<211> 165 
<212> PRT 
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<213> Streptococcus pneumoniae 
<400> 179 

Met Leu Lys Ser Glu Lys Gin Ser Arg Tyr Gin Met Leu Asn Glu Glu 
5 1 5 10 15 

Leu Ser Phe Leu Leu Glu Gly Glu Thr Asn Val Leu Ala Asn Leu Ser 
20 25 30 

10 Asn Ala Ser Ala Leu lie Lys Ser Arg Phe Pro Asn Thr Val Phe Ala 
35 40 45 

Gly Phe Tyr Leu Phe Asp Gly Lys Glu Leu Val Leu Gly Pro Phe Gin 
50 55 60 

15 

Gly Gly Val Ser Cys He Arg He Ala Leu Gly Lys Gly Val Cys Gly 
65 70 75 80 

Glu Ala Ala His Phe Gin Glu Thr Val He Val Gly Asp Val Thr Thr 
20 85 90 95 

Tyr Leu Asn Tyr He Ser Cys Asp Ser Leu Ala Lys Ser Glu He Val 
' 100 105 110 

25 Val Pro Met Met Lys Asn Gly Gin Leu Leu Gly Val Leu Asp Leu Asp 
115 120 125 

Ser Ser Glu He Glu Asp Tyr Asp Ala Met Asp Arg Asp Tyr Leu Glu 
130 135 140 

30 

Gin Phe Val Ala He Leu Leu Glu Lys Thr Ala Trp Asp Phe Thr Met 
145 150 155 160 

Phe Glu Glu Lys Ser * 
35 165 



<210> 180 • 

<211> 209 * 

40 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 180 

Met Thr He Glu Leu Leu Thr Pro Phe Thr Lys Val Glu Leu Glu Pro 
45 1 5 10 15 

Glu He Lys Glu Lys Lys Arg Lys Gin . Val Gly He Leu Gly Gly Asn 
20 25 30 

50 Phe Asn Pro Val His Asn Ala His Leu He Val Ala Asp Gin Val Arg 
35- 40 45 

Gin Gin Leu Gly Leu "Asp Gin Val Leu Leu Met Pro Glu Tyr Gin Pro 
50 55 60 

55 

Pro His Val Asp Lys Lys Glu Thr He Pro Glu His His Arg Leu Lys 
65 70 75 80 
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Met Leu Glu Leu Ala He Glu Gly He Asp Gly Leu Val lie Glu Thr 
85 90. 95 

5 He Glu Leu Glu Arg Lys Gly He Ser Tyr Thr Tyr Asp Thr Met Lys 
100 105 110 

He Leu Thr Glu Lys Asn Pro Asp Thr Asp Tyr Tyr Phe He He Gly 
115 120 125 

10 

Ala Asp Met Val Asp Tyr Leu Pro Lys Trp Tyr Arg He Asp Glu Leu 
130 135 140 

Val Asp Met Val Gin Phe Val Gly Val Gin Arg Pro Arg Tyr Lys Val 
15 145 150 155 160 

Gly Thr Ser Tyr Pro Val He Trp Val Asp Val Pro Leu Met Asp He 
165 170 175 

20 Ser Ser Ser Met Val Arg Ala Phe Leu Ala Gin Gly Arg Lys Pro Asn 
180 185 190 

Phe Leu Leu Pro Gin Pro Val Leu Asp Tyr He Glu Lys Glu Gly Leu 
195 200 205 

25 

Tyr 



30 <210> 181 

<211> 255 

<212> PRT 

<213> Streptococcus pneumoniae 

35 <400> 181 

Met Asn He Ala Lys. He Val Arg Glu Ala Arg Glu Gin Ser Arg Leu 
.1 5 10 15 

Thr Thr Leu Asp Phe Ala Thr Gly He Phe Asp Glu Phe He Gin Leu 
40 20 25 30 

His Gly Asp Arg Ser Phe Arg Asp Asp' Gly Ala Val Val Gly Gly He 
35 40 45 

45 Gly Trp Leu Gly Asp Gin Ala Val Thr Val Val Gly He Gin Lys Gly 
50 55 60 

Lys Ser Leu Gin Asp Asn Leu Lys Arg Asn Phe Gly Gin Pro His Pro 
65 70 75 80 

50 

Glu Gly Tyr Arg Lys Ala Leu Arg Leu Met Lys Gin Ala Glu Lys Phe 
85 90 95 

Gly Arg Pro Val Val Thr Phe He Asn Thr Ala Gly Ala Tyr Pro Gly 
55 100 105 110 

Val Gly Ala Glu Glu Arg Gly Gin Gly Glu Ala He Ala Afg Asn Leu 
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115 120 125 

Met Glu Met Ser Asp Leu Lys Val Pro lie lie Ala He He lie Gly 
130 135 140 

Glu Gly Gly Ser Gly Gly Ala Leu Ala Leu Ala Val Ala Asp Arg Val 
145 150 155 160 

Trp Met Leu Glu Asn Ser He Tyr Ala He Leu Ser Pro Glu Gly Phe 
165 170 175 

Ala Ser He Leu Trp Lys Asp Gly Thr Arg Ala Met Glu Ala Ala Glu 
180 ■ 185 190 

Leu Met Lys He Thr Ser His Glu Leu Leu Glu Met Asp Val Val Asp 
195 200 205 

Lys Val He Ser Glu Val Gly Leu Ser Ser Lys Glu Leu He Lys Ser 
210 215 220 

Val Lys Lys Glu Leu Gin Thr Glu Leu Ala Arg Leu Ser Gin Lys Pro 
225 230 235 240 

Leu Glu Glu Leu Leu Glu Glu Arg Tyr Gin Arg Phe Arg Lys Tyr 
245 250 255 



<210> 182 
<211> 169 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 182 

Met He He Lys Val Glu Met Ala Asp Val Glu Val Leu Ala Lys He 
1.5 10 15 

Ala Lys Gin Thr Phe Arg Glu Thr Phe Ala Tyr Asp Asn Thr Glu Glu 
20 25 30 

Gin Leu Gin Glu Tyr Phe Glu Glu Ala Tyr Ser Leu Lys Thr Leu- Ser 
35 40 45 

Thr Glu Leu Gly Asn Pro Asp Ser Glu Thr Tyr Phe He Met His Glu 
50 55 60 

Glu Glu lie Ala Gly Phe Leu Lys Val Asn Trp Gly Ser Ala Gin Thr 
65 70 75 80 

Glu Arg Glu Leu Glu Asp Ala Phe Glu He Gin Arg Leu Tyr Val Leu 
85 90 95- 

Gin Lys Phe Gin Gly Phe Gly Leu Gly Lys Gin Leu Phe Glu Phe Ala 
100 105 110 

Leu Glu Leu Ala Thr Lys Asn Ser Phe Ser Trp Ala Trp Leu Gly Val 
115 120 125 
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Trp Glu His Asn Thr Lys Ala Gin Ala Phe Tyr Asn Arg Tyr Gly Phe 

130 135 140 

Glu Lys Phe'Ser Gin His His Phe Met Val Gly Gin Lys Val Asp Thr 

5 145 . 150 155 160 



Asp Trp Leu Leu Arg Lys Lys' Leu Arg 
165 

10 

<210> 183 
<211> 529 
<212> PRT 

<213> Streptococcus pneumoniae 

15 

<400> 183 

Met Leu Arg Gly Thr Ala Leu Leu Thr Ala Ser Asn Phe lie Ser Arg 
1 5 10 15 

20 Leu Leu Gly Ala Val Tyr lie He Pro Trp Tyr He Trp Met Gly Ala 
20 25 30 

Tyr Ala Ala Lys Ala Asn Gly Leu Phe Thr Met Gly Tyr Thr He Tyr 
35 40 45 

25 

Ala Trp Phe Leu Leu Val Ser Thr Ala Gly He Pro Val Ala Val Ala 
50 55 60 

Lys Gin Val Ala Lys Tyr Asn Thr Met Arg Glu Glu Glu His Ser Phe 
30 65 70 75 80 

Ala Leu He Arg Ser Phe Leu Gly Phe Met Thr Gly Leu Gly Leu Val 
85 90 95 

35 Phe Ala Leu Val Leu Tyr Val Phe Ala Pro Trp Leu Ala Asp Leu Ser 
100 105 110 



40 



Gly Val Gly Lys Asp Leu He Pro He Met Gin Ser Leu Ala Trp Gly 
115 120 125 

Val Leu He Phe Pro Ser Met Ser Val He Arg Gly Phe Phe Gin Gly 
130 135 140 



Met Asn Asn Leu Lys Pro Tyr Ala Met Ser Gin He Ala Glu Gin Val 
45 145 150 155 160 



He Arg Val He Trp Met Leu Leu 
165 

50 Gly Ser Gly Asp Tyr Leu Ala Ala 
180 

Phe Val Gly Met Val Ala Ser Phe 
195 . 200 

55 

Gin Glu Ser Ser Leu Lys Arg Val 
210 215 



Ala Thr Phe He He Met Lys Leu 
170 175 

Val Thr Gin Ser Thr Phe Ala Ala 
185 190 

Ala Val Leu He Tyr Phe Leu Ala 
205 

Phe Glu Thr Gly Asp Lys He Asn 
220 
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Ser Lys Arg Leu Leu Val Asp Thr lie Lys Glu Ala lie Pro Phe lie 
225 230 235 240 

Leu Thr Gly Ser Ala lie Gin lie Phe Gin lie Leu Asp Gin Leu Thr 
245 250 255 

Phe lie Asn Ser Met Ser Trp Phe Thr Asn Tyr Ser Asn Glu Asp Leu 
260 265 270 

Val Val Met Phe .Ser Tyr Phe Ser Ala Asn Pro Asn Lys lie Thr Met 
. 275 280 285 

lie Leu lie Ser Val Gly Val Ser He Gly Ser Val Gly Leu Pro Leu 
290 295 300 

Leu Thr Glu Asn Tyr Val Lys Gly Asp Leu Lys Ala Ala Ser Arg Leu 
305 310 315 320 

Val Gin Asp Ser Leu Thr Leu Leu Phe Met Phe Leu Leu Pro Ala Thr 
325 330 335 

Val Gly Val Val Met Val Gly Glu Pro Leu Tyr Thr Val Phe Tyr Gly 
340 345 350 

Lys Pro Asp Ser Leu Ala Leu Gly Leu Phe Val Phe Ala Val Leu Gin 
355 360 365 

Ser He He Leu Gly Leu Tyr Met Val Leu Ser Pro Met Leu Gin Ala 
370 375 380 

Met Phe Arg Asn Arg Lys Ala Val Leu Tyr Phe He Tyr Gly Ser He 
385 390 395 400 

Ala Lys Leu Val Leu Gin Leu Pro Thr He Ala Leu Phe His Ser Tyr 
405 410 415 

Gly Pro Leu He Ser Thr Thr He Ala Leu He He Pro Asn Val Leu 
420 425 430 

Met Tyr Arg Asp lie Cys Lys Val Thr Gly Val Lys Arg Lys Val He 
435 440 445 

Leu Lys Arg Thr He Leu lie Ser Leu Leu Thr Leu Val Met Phe Leu 
450 455 460 

Leu He Gly Thr He Gin Trp Leu Leu. Gly Phe Phe Phe Gin Pro Ser 
465 470 475 480 

Gly Arg Leu Trp Ser Phe Phe Tyr Val Ala Leu Val Gly Ala Met Gly 
485 490 495 

Gly Gly Leu Tyr Met Val Met Ser Leu Arg Thr Tyr Leu Leu Asp Lys 
500 505 510 



Val He Gly Lys Ala Gin Ala Asp Arg Leu Arg Ala Lys Phe Lys Leu 
515 520 525 
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Ser 



<210> 184 
<211> 155 
<212> PRT . 

<213> Streptococcus pneumoniae 



<400> 184 

Met Ser Asp Lys lie Gly Leu Phe Thr Gly Ser Phe Asp Pro Met Thr 
1 5 ' 10 * 15 

Asn Gly His Leu Asp lie lie Glu Arg Ala Ser Arg Leu Phe Asp Lys 
20 25 30 

Leu Tyr Val Gly lie Phe Phe Asn Pro His Lys Gin Gly Phe Leu Pro 
35 40 45 

lie Glu Asn Arg Lys Arg Gly Leu Glu Lys Ala Leu Gly His Leu Glu 
50 55 60 

Asn Val Glu Val Val Ala Ser His Asp Glu Leu Val Val Asp Val Ala 
65 70 75 80 

Lys Arg Leu Gly Ala Thr Cys Leu Val Arg Gly Leu Arg Asn Ala Ser 
85 90 95 

Asp Leu Gin Tyr Glu Ala Ser Phe Asp Tyr Tyr Asn His Gin Leu Ser 
100 105 110 



Ser Asp lie Glu Thr He Tyr Leu His Ser Arg Pro Glu His Leu Tyr 
115 120 125 



He Ser Ser Ser Gly Val Arg Glu Leu Leu Lys Phe Gly Gin Asp He 
130 135 140 

Ala Cys Tyr Val Pro Glu Ser He Trp Arg Lys 
145 150 . 155 



<210> 185 
<211> 143 
<212> PRT 

<213> Streptococcus pneumoniae 

<400> 185 - . 

Met Thr He Leu Phe Val Val He Ser Ala Ser Phe Leu Tyr Met Val 
1 5 10 15 

Ser Leu Ser Met Lys Pro Tyr Gin Thr Ala Lys Ser Glu. Gly Glu Lys 
20 * 25 30 

Leu Ala Gin Gin Tyr Ala Gly Leu Glu Gin Ala Asp Gin Val Asp Leu 
35 40 45 
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Tyr Asn Gly Leu Glu Ser Tyr Tyr Ser Val Leu Gly Arg Asn Lys Gin 
50 55 60 

Gin Glu Ala Leu Ala Val Leu lie Gly Lys Asp Asp His Lys He Tyr 
5 65 '70 75 80 

Val Tyr Gin Leu Asn Gin Gly Val Ser Gin Glu Lys Ala Glu Thr Val 
85 90 95 

10 Ser Lys Glu Lys Gly Ala Gly Glu He Asp Lys He He Phe Gly Arg 
100 105 110 



15 



40 



.55 



Tyr Gin Asp Lys Pro He Trp Glu Val Lys Ser Gly Ser Asp Phe Tyr 
115 120 125 

Leu Val Asp Phe Glu Thr Gly Ala Leu Val Asn Lys Glu Gly Leu 
130 135 140 



20 <210> 186 
<211> 243 
<212> PRT 

<213> Streptococcus pneumoniae 
25 <400> 186 

Met He Asp He His Ser His He Val Phe Asp Val Asp Asp Gly Pro 
15 10 15 

Lys Ser Arg Glu Glu Ser Lys Ala Leu Leu Thr Glu Ala Tyr Arg Gin 
30 20 25 30 

Gly Val Arg Thr He Val Ser Thr Ser His Arg Arg Lys Gly Met Phe 
35 40 45 

35 Glu Thr Pro Glu Glu Lys He Ala Glu Asn Phe Leu Gin Val Arg Glu 
50 55 60 



He Ala Lys Glu Val Ala Ser Asp. Leu Val He Ala Tyr Gly Ala Glu 

65 70 75 80 

lie Tyr Tyr Thr Pro Asp Val Leu Asp Lys Leu Glu Asn Asn Arg He 

85 90 95 



Pro Thr Leu Asn Asn Ser Arg Tyr Ala Leu He Glu Phe Ser Met Asn 
45 100 105 HO 

Thr Pro Tyr Arg Asp He His Ser Ala, Leu Asn Lys lie Leu Met Leu 
115 120 125 

50 Gly He Thr Pro Val He Ala His lie Glu Arg Tyr Asp Val Leu Glu 
130 135 140 



Asn Asn Glu Lys' Arg Val Arg Glu Leu lie Asp Met Gly Cys Tyr Thr 
145 150 155 160 

Gin lie Asn Ser Ser His Val Leu Lys Ser Lys Leu Phe Gly "Glu Pro 
165 170 175 
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Tyr Lys Phe Met Lys Lys Arg Ala 
180 

Val His He He Ala Ser Asp Met 
195 200 

His Met Ala Glu Ala Tyr Asp Leu 
210 215 

Lys Ala Gin Glu Leu Phe He Asp 
225 230 

Gin Leu He 



Gin Tyr Phe Leu Glu Arg Asp Leu 
185 190 

His Asn Val Asp Gly Arg Pro Pro 
205 

Val Ser Gin Lys Tyr Gly Glu Ala 
220 

Asn Pro Arg Lys He Val Met Asp 
.235 240 



<210> 187 
<211> 308 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 187 

Met Ser Thr He Asp Lys Glu Lys Phe Gin Phe Val Lys Arg Asp Asp 
15 10 15 

Phe Ala Ser Glu Thr He Asp Ala Pro Ala Tyr Ser Tyr Trp Lys Ser 
20 • 25 30 

Val Phe Lys Gin Phe Met Lys Lys Lys Ser Thr Val Val Met Leu Gly 
35 40 45 

He Leu Val Ala He He Leu He Ser Phe He Tyr Pro Met Phe Ser 
50 55 60 

Lys Phe Asp Phe Asn Asp Val Ser Lys Val Asn Asp Phe Ser Val Arg 
65 70 75 80 

Tyr He Lys Pro Asn Ala Glu His Trp Phe Gly Thr Asp Ser Asn Gly 
85 90 95 

Lys Ser Leu Phe Asp Gly Val Trp Phe Gly Ala Arg Asn Ser He Leu 
100 105 110 

lie Ser Val He Ala Thr Val He Asn Leu Val He Gly Val Phe Val 
115 120 " 125 

Gly Gly He Trp Gly He Ser Lys Ser Val Asp Arg Val Met Met Glu 
130 135 140 

Val Tyr Asn Val He Ser Asn He Pro Pro Leu Leu He Val He Val 
145 150 155 160 

Leu Thr Tyr Ser He Gly Ala Gly Phe Trp Asn Leu He Phe Ala Met 
165 170 175 

Ser Val Thr Thr Trp He Gly He Ala Phe Met He Arg Val Gin He 



129 



WO 01/49721 



PCT/US00/35604 



180 

Leu Arg Tyr Arg Asp Leu Glu Tyr 
195 200 

. Thr Pro Thr Leu Lys lie Val Ala 
210 . 215 

Ser Val He Val Thr Thr Met Thr 
225 230 

Tyr Glu Ala Phe Leu Ser Phe Phe 
245 

Pro Ser Leu Gly Arg Leu He Ser 
260 

Asn Ala Tyr Leu Phe Trp He Pro 
275 280 

Leu Ser Leu Phe Val Val Gly Gin 
290 295 

Arg Thr His Arg 
305 



185 190 

Asn Leu Ala Ser Arg Thr Leu Gly 
205 

Lys Asn He Met Pro Gin Leu Val 
220 

Gin Met Leu Pro Ser Phe He Ser 
235 240 

Gly Leu Gly Leu Pro He Thr Val 
250 255 

Asp Tyr Ser Gin Asn Val Thr Thr 
265 270 

Leu Thr Thr Leu Val Leu Val Ser 
285 

Asn Leu Ala Asp Ala Ser Asp Pro 
300 



<210> 188 

<211> 77 

<212> PRT 

- <213> Streptococcus pneumoniae 

<400> 188 

Met Tyr Asn Leu Leu Leu Thr He Leu Leu Val Leu Ser Val Val He 
1,5 .10 15 

Val He Ala He Phe Met Gin Pro Thr Lys Asn Gin Ser Ser Asn Val 
20 25 30 

Phe Asp Ala Ser Ser Gly Asp Leu Phe Glu Arg Ser Lys Ala Arg Gly 
.35 40 45 

Phe Glu Ala Val Met Gin Arg Leu Thr Gly He Leu Val Phe Phe Trp 
50 55 60 

Leu Ala He Ala Leu Ala Leu Thr Val Leu Ser Ser Arg 

65 70 * * _ 75 



<210> 189 
<211> 369 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 189 

Met Phe Arg Arg Asn Lys Leu Phe Phe Trp Thr Thr Glu He Leu Leu 
15 10 15 
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Leu Thr lie lie Phe Tyr Leu Trp Arg Gin Met Gly Ser Leu lie Asn 
^20 25 30 

Pro Phe Val Ser Val Leu Asn Thr lie Met He Pro Phe Leu Leu Gly 
35 40 45 

Gly Phe Leu Tyr Tyr Leu Thr Asn Pro He Val Thr Phe Leu Asn Lys 
50 55 60 

Val Cys Lys Leu Asn Arg Leu Leu Gly He Leu He Thr Leu Gys Thr 
65 • 70 75 f 80 

Leu Val Trp Gly Met Val He Gly Val Val Tyr Leu Leu Pro He Leu 
85 90 95 

He Asn Gin Leu Ser Ser Leu He He Ser Ser Gin Thr He Tyr Ser * 
100 105 . 110 

Arg Val Gin Asp Leu He He Asp Leu Ser Asn Tyr Pro Ala Leu Gin 
115 120 125 

Asn Leu Asp Val Glu Ala Thr He Gin Gin Leu Asn Leu Ser Tyr Val 
130 135 140 

Asp lie Leu Gin Asn He Leu Asn Ser Val Ser Asn Ser Val Gly Ser 
145 150 155 160 

Val Leu Ser Ala Leu He Ser Thr Val Leu He Leu lie Met Thr Pro 
165 170 175 

Val Phe Leu Val Tyr Phe Leu Leu Asp Gly His Lys Phe Leu Pro .Met 
180 185 190 

Leu Glu Arg Thr lie Leu Lys Arg Asp Arg Leu His lie Ala Gly Leu 
195 200 205 

Leu Lys Asn Leu Asn Ala Thr lie Ala Arg Tyr lie Ser Gly Val Ser 
.210 215 ^ 220 

lie Asp Ala lie lie lie Gly Cys Leu Ala Tyr lie Gly Tyr Ser He 
225 230 235 240 

lie Gly Leu Lys Tyr Ala Leu Val Phe Ala lie Phe Ser Gly Val Ala 
245 250 255 

Asn Leu lie Pro Tyr Val Gly Pro Ser. lie Gly Leu lie Pro Met lie 
260 265 270 

lie Ala Asn lie Phe Thr Val Pro His Arg Leu Leu lie Ala Val lie 
275 280 285 

Tyr Met Leu Val Val Gin Gin Val Asp Gly Asn lie Leu Tyr Pro Arg 
290 295 300 



lie Val Gly Ser Val Met Lys Val His Pro lie Thr lie Leu Val Leu 
305 310 315 320. 
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Leu Leu Leu Ser Ser Asn lie Tyr Gly Val Val Gly Met lie Val Ala 
325 330 335 

5 Val Pro Thr Tyr Ser lie Leu Lys Glu lie Ser Lys Phe Leu Ser Arg 
340 345 350 

Leu Tyr Glu Asn His Lys lie Met Lys Glu Arg Glu Arg Glu Leu Ala 
355 360 365 

10 

Lys 



15 <210> 190 
<211> 451 
<212> PRT 

<213> " Streptococcus pneumoniae 
20 <400> 190 

Met Tyr Gin Ala Leu Tyr Arg Lys Tyr Arg Ser Gin Asn Phe Ser Gin 
1.5 10 15 

Leu Val Gly Gin Glu Val Val Ala Lys Thr Leu Lys Gin Ala Val Glu 
25 20 25 *30 

Gin Glu Lys lie Ser His Ala Tyr Leu Phe Ser Gly Pro Arg Gly Thr 
35 - 40 45 

30 Gly Lys Thr Ser Val Ala Lys lie Phe Ala Lys Ala Met Asn Cys Pro 
50 55 60 

Asn Gin Val Gly Gly Glu Pro Cys Asn Asn Cys Tyr lie Cys Gin Ala 
65 70 75 80 

35 

Val Thr Asp Gly Ser Leu Glu Asp Val He Glu Met Asp Ala Ala Ser 
85 90 95 

Asn' Asn Gly Val Asp Glu He Arg Glu lie Arg Asp Lys Ser Thr Tyr 
40 100 105 110 

Ala Pro Ser Leu Ala Arg Tyr Lys Val Tyr He He Asp Glu Val His 
115 120 125 

45 Met Leu Ser Thr Gly Ala Phe Asn Ala Leu Leu Lys Thr Leu Glu Glu 
130 135 140 

Pro Thr Gin Asn Val Val Phe He Leu Ala Thr Thr Glu Leu His Lys 
145 150 • 155 160 

50 

He Pro Ala Thr He Leu Ser Arg Val Gin Arg Phe Glu Phe Lys Ser 
165 170 175 

He Lys Thr Gin Asp lie Lys Glu His He His Tyr He Leu Glu Lys 
55 180 185 190 

Glu Asn He Ser Ser Glu Pro Glu Ala Val Glu He lie Ala Arg Arg 
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195 



200 



205 



Ala Glu Gly Gly Met Arg Asp Ala Leu Ser lie Leu Asp Gin Ala Leu 
210 215 220 

Ser Leu Thr Gin Gly Asn Glu Leu Thr Thr Ala He Ser Glu Glu He 
225 230 235 240 

Thr Gly Thr He Ser Leu Ser Ala Leu Asp Asp Tyr Val Ala Ala Leu 
245 250 255 

Ser Gin Gin Asp Val Pro Lys Ala Leu Ser Cys Leu Asn Leu Leu Phe 
260 265. 270 

Asp Asn Gly Lys Ser Met Thr Arg Phe Val Thr Asp Leu Leu His Tyr 
275 280 285 

Leu Arg Asp Leu Leu He Val Gin Thr Gly Gly Glu Asn Thr His His 
290 295 300 

Ser Ser Val Phe Val Glu Asn Leu Ala Leu Pro Gin Lys Asn Leu Phe 
305 310 315 320 

Glu Met He Arg Leu Ala Thr Val Asn Leu Ala Asp He Lys Ser Ser 
325 330 335 

Leu Gin Pro Lys He Tyr Ala Glu Met Met Thr Val Arg Leu Ala Glu 
340 345 350 

He Lys Pro Glu Pro Ala Leu Ser Gly Ala Val Glu Asn Glu lie Ala 
355 360 365 

Thr Leu Arg Gin Glu Val Ala Arg Leu Lys Gin Glu Leu Ser Asn Ala 
370 375 380 

Gly Ala Val Pro Lys Gin Val Ala Pro Ala Pro Ser Arg Pro Ala Thr 
385 390 395 400 

Gly Lys Thr Val Tyr Arg Val Asp Arg Asn Lys Val Gin Ser He Leu 
405 410 415 

Gin Glu Ala Val Glu Asn Pro Asp Leu Thr Arg Gin Asn Leu He Arg 
420 425 430 

Leu Gin Asn Ala Trp Gly Glu Val He Glu Ser Leu Gly Gly Pro Asp 
435 440 445 



Lys Leu Cys 
450 



<210> 191 
<211> 662 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 191 
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Met Phe Arg Leu Thr Asn Lys Leu Ala Val Ser Asn Leu He Lys Asn 
15 10 15 

Arg Lys Leu Tyr Tyr Pro Phe Ala Leu Ala Val Leu Leu Ala Val Thr 
5 20 25 30 

Leu Thr Tyr Leu Phe Tyr Ser Leu Thr Phe Asn Pro Lys He Ala Glu 
35 40 45 

10 He Arg Gly Gly Thr Thr He Gin Ala Thr Leu Gly Phe Gly Met Phe 
50 55 60 



15 



Val Val Thr Leu Ala Ser Ala He He Val Leu Tyr Ala Asn Ser Phe 

65 70 75 80 

Val Met Lys Lys Arg Ser Lys Glu Leu Gly He Tyr Gly Met Leu Gly 

- 85 90 95 



Leu Glu Lys Arg His Leu He Ser Met Thr Phe Lys Glu Leu' Val Val 
20 100 105 110 

Phe Gly He Leu Thr Val Gly Ala Gly He Gly He Gly Ala Leu Phe 
115 120 125 

25 Asp Lys Leu lie Phe Ala Phe Leu Leu Lys Leu Met Lys Leu Lys Val 

130 135 140 



30 



Glu Leu Val Ala Thr Phe Gin Thr Lys Val Val He Thr Val Leu Val 

145 150 155 160 

Val Phe Gly Leu lie Phe Leu Gly Leu Met Phe Leu Asn Ala Leu Arg 

165 170 175 



He Ala Arg Met Asn Ala Leu Gin Leu Ser Arg Glu Lys Ala Ser Gly 
35 180 185 190 

Glu Lys Lys Gly Arg Phe Leu Pro Leu Gin Thr He Leu Gly Ser He 

195 200 205 

40- Ser Leu Gly He Gly Tyr Tyr Leu Ala Leu Thr Val Lys Asp Pro Leu 
210 215 220 



45 



Thr Ala Leu Thr Thr Phe Phe He Ala Val Leu Leu Val He Phe Gly 
225 230 235 240 

Thr Tyr Leu Leu Phe Asn Ala Gly He Thr Val Phe Leu Gin lie Leu 
245 250 255 



Lys Lys Asn Lys Lys Tyr Tyr Tyr Gin Pro Asn Asn Leu lie Ser Val 
50 260 265 270 

Ser Asn Leu He Phe Arg Met Lys Lys Asn Ala Val Gly Leu Ala Thr 
275 280 . 285 



55* He Ala lie Leu Ser Thr Met Val Leu Val Thr Met Ser Ala Ala Thr 
290 295 300 
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Ser lie Phe Asn Ser Ala Glu Ser 
305 310 

Asp Phe Gly Val Ser Gly Gin Asn 
325 

Leu Leu Ser Gin Phe Ala Ser Asp 
340 

Glu Val Phe Arg Tyr Thr Tyr Phe 
355 360 

Lys Leu Thr Phe Phe Glu Lys Gly 
370 375 

Val Phe Met Val Phe Asp Gin Lys 
385 390 

Lys Leu Ser Leu Ser Gly Asn Glu 
405 

Gly Leu Lys Gly Gin Lys Thr Leu 
420 



Phe Lys Lys Val Leu Asn Pro His 
315 320 

Val Glu Lys Glu Asp Leu Asp Lys 
330 335 

Asn Gly Tyr Lys lie Lys Glu Lys 
345 350 

Gly Val Ala Asn Gin Glu Gly Asn 
365 

Gin Asn Arg Val Gin Pro Thr Thr 
380 

Asp Tyr Glu Asn Met Thr Gly Gin 
395 400 

Val Gly Leu Phe Ala Lys Asn Asp 
410 415 

lie Leu Asn Asp His Gin Phe Ser 
425 430 



Val Lys Glu Glu Phe Asn Lys Asp Phe He Val Asn His Val Pro Asn 
435 440 445 

Gin Phe Asn He Leu Thr Ala Asp Tyr Asn Tyr Leu Val Val Pro Asp 
450 455 460 

Leu Gin Ala Phe Leu Asn Gin Phe Pro Asp Ser Asp He Tyr Asn Gin 
465 470 475 480 



Phe Tyr Gly Gly Met Asn Val Asn Val Ser Glu Glu Glu Gin Leu Lys 
485 490 495 

Val Ala Glu Glu Tyr Glu Asn Tyr Leu Asn Gin Phe Asn Ala Gin Leu 
500 505 510 

Asp Thr Glu Gly Ser Tyr Val Tyr Gly Ser Asn Leu Ala Asp Ala Ser 
515 520 525 

Ser Gin Met Ser Ala Leu Phe Gly Gly Val Phe Phe He Gly He Phe 
530 535 540 

Leu Ser He He Phe Met Val Gly Thr Val Leu Val He Tyr Tyr Lys 
545 550 555 560 

Gin lie Ser Glu Gly Tyr Glu Asp Arg Glu Arg Phe lie lie Leu Gin 
565 570 575 

Lys Val Gly Leu Asp Gin Lys Gin He Lys Gin Thr He His Lys Gin 
580 585 590 



Val Leu Thr Val Phe Phe Leu Pro Leu Leu Phe Ala Phe lie His Leu 
595 600 605 
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Ala Phe Ala Tyr His Met Leu Ser Leu lie Leu Lys Val lie Gly Val 
610 615 620 

Leu Asp Thr Thr Met Met Leu lie Val Thr Leu Ser He Cys Ala He 
5 625 630 635 640 

Phe Leu He Ala Tyr Val Leu He Phe Met He Thr Ser Arg Ser Tyr 
645 650 655 

10 Arg Lys He Val Gin Met 
660 



<210> 192 
15 <211> 296 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 192 

20 Met Lys Gin Asp Gin Leu Lys Ala Trp Gin Pro Ala Gin Phe Asp Arg 
1 5 10 15 

Phe Val Arg He Leu Glu Gin Asp Gin Leu Asn His Ala Tyr Leu Phe 
20 25 30 

Ser Gly Phe Phe Gly Ser Leu Glu Met Ala Gin Phe Leu Ala Lys Ser 
35 * 40 45 



25 



Leu Phe Cys Thr Asp Lys Val Gly Val Leu Pro Cys Glu Lys Cys Arg 
-30 50 55 60 

Ser Cys Lys Leu He Glu Gin Glu Glu Phe Pro Asp Val Thr Leu He 

65 70 75 80 

35 Lys Pro Val Asn Gin Val He Lys Thr Glu Arg He Arg Glu Leu Val 

85 90 95 



40 



55 



Gly Gin Phe Ser Gin Ala Gly lie Glu Ser Gin Gin Gin Val Phe He 

100 105 110 

He Glu Gin Ala Asp Lys Met His Pro Asn Ala Ala Asn Ser Leu Leu 
115 120 125 



Lys Val He Glu Glu Pro Gin Ser Glu Val Tyr He Phe Phe Leu Thr 
45 130 135 140 

Ser Asp Glu Glu Lys Met Leu Pro Thr lie Arg Ser Arg Thr Gin He 

145 150 .... 155 160 

50 Phe His Phe Lys Lys Gin Glu Glu Lys Leu He Leu Leu Leu Glu Glh 

• 165 170' 175 



Met Gly Leu Val Lys Lys Lys Ala Thr Leu Leu Ala Lys Phe Ser Gin 
180 185 190 

Ser Arg Ala Glu Ala Glu Lys' Leu Ala Asn Gin Ala Ser Phe Trp Thr 
195 ' 200 205 
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Leu Val Asp Glu Ser Glu Arg Leu Leu Thr Trp Leu Val Ala Lys Lys 
210 215 220 

Lys Glu Ser Tyr Leu Gin Val Ala Lys Leu Ala Asn Leu Ala Asp Asp 
225 230 235 240 

Lys Glu Lys Gin Asp Gin Val Leu Arg lie Leu Glu Val Leu Cys Gly 
245 250 255 

Gin Asp Leu Leu Gin Val Arg Val Arg Val lie Leu Gin Asp Leu Leu 
260 . 265 270 

Glu Ala Arg Lys Met Trp Gin Ala Asn Val Ser Phe Gin Asn Ala Met 
275 280 285 

Glu Tyr Leu Val Leu Lys Glu lie 
290 295 



<210> 193 
<211> 204 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 193 

Met Asn Ser Phe Lys Asn Phe Leu Lys Glu Trp Gly Leu Phe Leu Leu 
1 5 10 15 

lie Leu Ser Leu Leu Ala Leu Ser Arg lie Phe Phe Trp Ser Asn Val 
20 25 30 

Arg Val Glu Gly His Ser Met Asp Pro Thr Leu Ala Asp Gly Glu lie 
35 40 " 45 

Leu Phe Val Val Lys His Leu Pro lie Asp Arg Phe Asp lie Val Val 
50 55 60 

Ala His Glu Glu Asp Gly Asn Lys Asp lie Val Lys Arg Val lie Gly 
65 70' 75' 80 

Met Pro Gly Asp Thr lie Arg Tyr Glu Asn Asp Lys Leu Tyr lie Asn 
85 90 95 

Asp Lys Glu Thr Asp Glu Pro Tyr Leu Ala Asp Tyr lie Lys Arg Phe 
100 105 110 

Lys Asp Asp Lys Leu Gin Ser Thr Tyr" Ser Gly Lys Gly Phe Glu Gly 
115 120 125 

Asn Lys Gly Thr Phe Phe Arg Ser lie Ala Gin Lys Ala Gin Ala Phe 
130 135 140 

Thr Val Asp Val Asn Tyr Asn Thr Asn Phe Ser Phe Thr Val Pro Glu 
145 150 155 160 

Gly Glu Tyr Leu Leu Leu Gly Asp Asp Arg Leu Val Ser Ser Asp Ser 
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165 



170 



175 



Arg His Val Gly Thr Phe Lys Ala Lys Asp He Thr Gly Glu. Ala Lys 
180 .185 190 



Phe Arg Phe Trp Pro He Thr Arg He Gly Thr Phe 
195 200 



10 <210> 194 
<211> 328 
<212> PRT 

<213> Streptococcus pneumoniae 

15 <400> 194 - 

Met Val Val Phe Thr Gly Ser Thr Val Glu Glu Ala He Gin Lys Gly 
1 5 10 15 

Leu Lys Glu Leu Asp He Pro Arg Met Lys Ala His lie Lys Val He 
20 20 .25, 30 

Ser Arg Glu Lys Lys Gly Phe Leu Gly Leu Phe Gly Lys Lys Pro Ala 
35 40 45 

25 Gin Val Asp He Glu Ala He Ser Glu Thr Thr Val Val Lys Ala Asn 
50 55 60 



30 



45 



Gin Gin Val Val Lys Gly Val Pro Lys Lys He Asn Asp Leu Asn Glu 
65 70 75 80 

Pro Val Lys Thr Val Ser Glu Glu Thr Val Asp Leu Gly His Val Val 
85 90 95 



Asn Ala He Lys Lys He Glu Glu Glu Gly Gin Gly He Ser Asp Glu 

35 100 105 110 

Val Lys Ala Glu He Leu Lys His Glu Arg His Ala Ser Thr He Leu 
115 120 125 

40 Glu Glu Thr Gly His He Glu He Leu Asn Glu Leu Gin lie Glu Glu 
130 135 140 



Ala Met Arg Glu Glu Ala Gly Ala Asp Asp Leu Glu Thr Glu Gin Asp 
145 150 155 160 

Gin Thr Glu Asn Gin Asp Leu Lys Glu Met Gly Leu Lys Val Glu Gin 
165 . 170 175 



Ser Tyr Asp lie Ala Gin Val Ala Thr Asp Val Thr Ala Tyr Val Gin 
50 180 185 190 

Ala lie Val Asp Asp Met Asp Val Glu Ala Thr Leu Ser Asn Asp Tyr 
195 200 205 

55 Asn Arg Arg Ser He Asn Leu Gin He Asp Thr Asn Glu Pro Gly Arg 
210 215 220 
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He He Gly Tyr His Gly Lys Val Leu Lys Ala Leu Gin Leu Leu Ala 
225 230 235 240 

Gin Asn Tyr Leu Tyr Asn Arg Tyr Ser Lys Thr Phe Tyr Val Thr He 
245 250 255. 

Asn Val Asn Asp Tyr Val Glu His Arg Ala Glu Val Leu Gin Thr Tyr 
260 265 270 

Ala Gin Lys Leu Ala Asn Arg Val Leu Glu Glu Gly Arg Ser His Lys 
275 280 285 

Thr Asp Pro Met Ser Asn Ser Glu Arg Lys He lie His Arg He He 
290 295 300 

Ser Arg Met Asp Gly Val Thr Ser Tyr Ser Glu Gly Asp Glu Pro Asn 
305 310 315 320 

Arg Tyr Val Val Val Asp Thr Glu 
325 



<210> 195 
<211> 460 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 195 

Met Ser Asn Phe Ala lie lie Leu Ala Ala Gly Lys Gly Thr Arg Met 
1 5 10 15 

Lys Ser Asp Leu Pro Lys Val Leu His Lys Val Ala Gly He Ser Met 
20 25 .30 

Leu Glu His Val Phe Arg Ser Val Gly Ala lie Gin Pro Glu Lys Thr 
35 40 45 

Val Thr Val Val Gly His Lys Ala Glu Leu Val Glu Glu Val Leu Ala 
50 55 60 

Gly Gin Thr Glu Phe Val Thr Gin Ser. Glu Gin Leu Gly Thr Gly His 
65 70 75 80 

Ala Val Met Met Thr Glu Pro He Leu Glu Gly Val Ser Gly His Thr 
85 90 95 

Leu Val He Ala Gly Asp Thr Pro Leu -lie Thr Gly Glu Ser Leu Lys 
100 105 110 

Asn Leu He Asp Phe His He Asn His Lys Asn Val Ala Thr He Leu 
115 120 125 ' 

Thr Ala Glu Thr Asp Asn Pro Phe Gly Tyr Gly Arg lie Val Arg Asn 
130 135 140 

Asp Asn Ala Glu Val Leu Arg Ser Leu Leu Ser Arg Arg Met Leu Gin 
145 150 155 i60 
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lie Leu Lys Ser Lys Ser Arg Lys Ser Thr Leu Val Thr Tyr Val Phe 
165 170 175 

Asp Asn Glu Arg Leu Phe Glu Ala Leu Lys Asn lie Asn Thr Asn Asn 
180 185 190 

Ala Gin Gly Glu Tyr Tyr lie Thr Asp Val He Gly He Phe Arg Glu. 
195 200 205 



Thr Gly Glu Lys Val Gly Ala Tyr Thr Leu Lys Asp Phe Asp Glu Ser 
210 215 220. 

Leu Gly Val Asn Asp Arg Val Ala Leu Ala Thr Ala Glu Ser Val Met 
225 230 235 240 

Arg Arg Arg He Asn His Lys His Met Val Asn Gly Val Ser Phe Val 
245 250 255 

Asn Pro Glu Ala Thr Tyr He Asp He Asp Val Glu He Ala Pro Glu 
260 265 270 

Val Gin He Glu Ala Asn Val lie Leu Lys Gly Gin Thr Lys He Gly 
275 280 285 

Ala Glu Thr Val Leu Thr Asn Gly Thr Tyr Val Val Asp Ser Thr He 
290 295 300 

Gly Ala Gly Ala Val He Thr Asn Ser Met He Glu Glu Ser Ser Val 
305 310 315 320 

Ala Asp Gly Val Thr Val Gly Pro Tyr Ala His He Arg Pro Asn Ser 
325 330 335 

Ser Leu Gly Ala Gin Val His He Gly Asn Phe Val Glu Val Lys Gly 
340 345 350 

Ser Ser lie Gly Glu Asn Thr Lys Ala Gly His Leu Thr Tyr He Gly 
355 360 365 



Asn Cys Glu Val Gly Ser Asn Val 
370 375 

Val Asn Tyr Asp Gly Lys Asn Lys 
385 390 

Val Phe Val Gly Ser Asn Ser Thr 
405 

Asp -Asn Ser Leu Val Gly Ala Gly 
420 

Ala Asp Ala He Ala lie Gly Arg 
435 440 

Tyr Ala Thr Arg Leu Pro His His 
450 455 



Asn Phe Gly Ala Gly Thr lie Thr 
380 

Tyr Lys Thr Val lie Gly Val Asn 
395 400 

He -He Ala Pro Val Glu Leu Gly 
410 415 

Ser Thr lie Thr Lys Asp Val Pro 
425 430 

Gly Arg Gin He Asn Lys Asp Glu 
445 

Pro Lys Asn Gin 
4 60 



140 



WO 01/49721 



PCT/US00/35604 



<210> 196 
<211> 311 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 196 

Met Ser Lys lie Leu Val Phe Gly His Gin Asn Pro Asp Ser Asp Ala 
15 10 15 

lie Gly Ser Ser Val Ala Phe Ala Tyr Leu Ala Lys Glu Ala Tyr Gly 
20 25 30 

Leu Asp Thr Glu Ala Val Ala Leu Gly Thr Pro Asn Glu Glu Thr Ala 
35 40 45 

Phe Val Leu Asn Tyr Phe Gly Val Glu Ala Pro Arg Val lie Thr Ser 
50 55 60 

Ala Lys Ala Glu Gly Ala Glu Gin Val lie Leu Thr Asp His Asn Glu 
65 70- 75 80 

Phe Gin Gin Ser Val Ser Asp lie Ala Glu Val Glu Val Tyr Gly Val 
85 90 95 

Val Asp His His Arg Val Ala Asn Phe Glu Thr Ala Ser Pro Leu Tyr 
100 105 110 

Met Arg Leu Glu Pro Val Gly Ser Ala Ser Ser lie Val Tyr Arg Met 
115 120 125 

Phe Lys Glu His Gly Val Ala Val Pro Lys Glu He Ala Gly Leu Met 
130 135 140 

Leu Ser Gly Leu He Ser Asp Thr Leu Leu Leu Lys Ser Pro Thr Thr 
145 150 155 160 

His Pro Thr Asp Lys He He Ala Pro Glu Leu Ala Glu Leu Ala Gly 
165 170 175 

Val Asn Leu Glu Glu Tyr Gly Leu Ala Met Leu Lys Ala Gly Thr Asn 
180 185 190 

Leu Ala Ser Lys Ser Ala Glu Glu Leu lie Asp He Asp Ala Lys Thr 
195 200 205 

Phe Glu Leu Asn Gly Asn Asn Val Arg Val Ala Gin Val Asn Thr Val 
210 215 220 

Asp lie Ala Glu Val Leu Glu Arg Gin Ala Glu He Glu Ala Ala Met 
225 230 235 240 

Gin Ala Ala Asn Glu Ser Asn Gly Tyr Ser Asp Phe Val Leu Met He 
245 250 255 

Thr Asp He Val Asn Ser Asn Ser Glu He Leu Ala Leu Gly Ala Asn 
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260 



265 



270 



Met Asp Lys Val Glu Ala Ala 
275 



Phe Asn Phe Lys Leu Glu Asn Asn His 
280 285 



Ala Phe Leu Ala Gly Ala Val Ser Arg Lys Lys Gin Val Val Pro Gin 
290 295 300 

Leu Thr Glu Ser Phe Asn Thr 



<210>*197 
<211> 225 
<212> PRT 
- <213> Streptococcus pneumoniae 

<400> 197 

Met lie Ser Lys Arg Leu Glu Leu Val Ala Ser Phe Val Ser Gin Gly 
1 5 10 15 

Ala lie Leu Leu. Asp Val Gly Ser Asp His Ala Tyr Leu Pro lie Glu 
20 25 . 30 

Leu Val Glu Arg Gly Gin He Lys Ser Ala He Ala Gly Glu Val Val 
35 40. 45 

Glu Gly Pro Tyr Gin Ser Ala Val Lys Asn Val Glu Ala His Gly Leu 
50 55 60 

Lys Glu Lys He Gin Val Arg Leu Ala Asn Gly Leu Ala Ala Phe Glu 
65 70-75 80 

Glu Thr Asp Gin Val Ser Val He Thr He Ala Gly Met Gly Gly Arg 
85 90 95 

Leu He Ala Arg He Leu Glu Glu Gly Leu Gly Lys Leu Ala Asn Val 
100 105 no 

Glu Arg Leu He Leu Gin Pro Asn Asn Arg Glu Asp Asp Leu Arg He 
115 120 125 

Trp Leu Gin Asp His Gly Phe Gin He Val Ala Glu Ser He Leu Glu 
130 135 140 

Glu Ala Gly Lys Phe Tyr Glu He Leu Val Val Glu Ala Gly Gin Met 
145 150 155 160 

Lys Leu Ser Ala Ser Asp Val Arg Phe Gly Pro Phe Leu Ser Lys Glu 
165 170 175 

Val Ser Pro Val Phe Val' Gin Lys Trp Gin Lys Glu Ala Glu Lys Leu 
180 185 190 

Glu Phe Ala Leu Gly Gin He Pro Glu Lys Asn Leu Glu Glu Arg Gin 



305 



310 



195 



200 



205 
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Val Leu Val Asp Lys He Gin Ala He Lys Glu Val Leu His Val Ser 
210 215 220 

Lys 

225 



<210> 198 
<211> 161 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 198 

Met Asn Leu Asn Asp He Lys Asp Leu Met Thr Gin Phe Asp Gin Ser 
1 5 10 .15 

Ser Leu Arg Glu Phe Ser Tyr Lys Asn Gly Thr Asp Glu Leu Gin Phe 
20 25 30 

Ser Lys Asn Glu Ala Arg Pro Val Pro Glu Val Ala Thr Gin Val Ala 
35 40 45 

Pro Ala Pro Val Leu Ala Thr Pro Ser Pro Val Ala Pro Thr Ser Ala 
50 55 60 

Pro Ala Glu Thr Val Ala Glu Glu Val Pro Ala Pro Ala Glu Ala Ser 
65 70 75 80 

Val Ala Ser Glu Gly Asn Leu Val Glu Ser Pro Leu Val Gly Val Val 
85 90 95 

Tyr Leu Ala Ala Gly Pro Asp Lys Pro Ala Phe Val Thr Val Gly Asp 
100 105 110 

Ser Val Lys Lys Gly Gin Thr Leu Val He He Glu Ala Met Lys Val 
115 120 125 

Met Asn Glu He Pro Ala Pro Lys Asp Gly Val Val Thr Glu He Leu 
130 135 140 

Val Ser Asn Glu Glu Met Val Glu Phe Gly Lys Gly Leu Val Arg He 
145 150 155" • 160 

Lys 



<210> 199 
<211> 411 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 199 * 

Met Lys Leu Asn Arg Val Val Val Thr Gly Tyr Gly Val Thr Ser Pro 
15 10 15 

He Gly Asn Thr Pro Glu Glu Phe Trp Asn Ser Leu Ala Thr Gly Lys 
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20 25 30 

He Gly He Gly Gly He Thr Lys Phe Asp His Ser Asp Phe Asp Val 
35 40 45 

5 

His Asn Ala Ala Glu He Gin Asp Phe Pro Phe Asp Lys Tyr Phe Val 
50 55 60 

Lys Lys Asp Thr Asn Arg Phe Asp Asn Tyr Ser Leu Tyr Ala Leu Tyr 
10 65 70 75 80 

Ala Ala Gin Glu Ala Val Asn His Ala Asn Leu Asp Val Glu Ala Leu 
85 90 95 

15 Asn Arg Asp Arg Phe Gly Val He Val Ala Ser Gly He Gly Gly He 
100 105 110 

Lys Glu He Glu Asp Gin Val Leu Arg Leu His Glu Lys Gly Pro Lys 
115 120 125 

20 

Arg Val Lys Pro Met Thr Leu Pro Lys Ala Leu Pro Asn Met Ala Ser 
130 135 140 

Gly Asn Val Ala Met Arg Phe Gly Ala Asn Gly Val Cys Lys Ser He 
25 145 150 155 160 

Asn Thr Ala Cys Ser Ser Ser Asn Asp Ala He Gly Asp Ala Phe Arg 
165 170 175 

30 Ser He Lys Phe Gly Phe Gin Asp Val Met Leu Val Gly Gly Thr Glu 
160 185 190 

Ala Ser He Thr Pro Phe Ala He Ala Gly Phe Gin Ala Leu Thr Ala 
195 200 205 

35 

Leu Ser Thr Thr Glu Asp Pro Thr Arg Ala Ser He Pro Phe Asp Lys 
210 215 220 

Asp Arg Asn Gly Phe Val Met Gly Glu Gly Ser Gly Met Leu Val Leu 
40 225 230 235 240 

Glu Ser Leu Glu His Ala Glu Lys Arg Gly Ala Thr He Leu Ala Glu 
245 250 255 

45 Val Val Gly Tyr Gly Asn Thr Cys Asp Ala Tyr His Met Thr Ser Pro 
260 265 270 

His Pro Glu Gly Gin Gly Ala He Lys- Ala He Lys Leu Ala Leu Glu 
275 280 285 

50 

Glu Ala Glu He Ser Pro Glu. Gin Val Ala Tyr Val Asn Ala His Gly 
290 295 300 

Thr Ser Thr Pro Ala Asn Glu Lys Gly Glu Ser Gly Ala He Val Ala 
55 305 310 315 320 

Val Leu Gly Lys Glu Val Pro Val Ser Ser Thr Lys Ser Phe Thr Gly 
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325 

His Leu Leu Gly Ala Ala Gly Ala 
340 

Ala Met Arg His Asn Phe Val Pro 
355 360 

Ser Asp Tyr lie Glu Ala Asn Val 
370 375 

Glu lie Pro Tyr Ala lie Ser Asn 
385 390 

Ala Val Leu Ala Phe Lys Arg Trp 
405 



330 335 

Val Glu Ala He Val Thr He Glu 
345 350 

Met Thr Ala Gly Thr Ser Glu Val 
365 

Val Tyr Gly Gin Gly Leu Glu Lys 
380 • 

Thr Phe Gly Phe Gly Gly His Asn 
395 400 

Glu Asn Arg 
410 



<210> 200 
<211> 359 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 200 

Met Asn He Tyr Asp Gin Leu Gin Val Val Glu Asp Arg Tyr Glu Glu 
15 10 15 

Leu Gly Glu Leu Leu Ser Asp Pro Asp Val Val Ser Asp Thr Lys Arg 
20 25 30 

Phe Met Glu Leu Ser Lys Glu Glu Ala Ser Asn Arg Asp Thr Val He 
35 40 45 

Ala Tyr Arg Glu Tyr Lys Gin Val Leu Gin Asn He Val Asp Ala Glu 
50 55 " 60 

Glu Met He Lys Glu Ser Gly Gly Asp Ala Asp Leu Glu Glu Leu Ala 
65 70 75 80 

Lys Gin Glu Leu Lys Asp Ala Lys Ala Glu Lys Glu Glu Tyr Glu Glu 
85 90 95 

Lys Leu Lys He Leu Leu Leu Pro Lys Asp Pro Asn Asp Asp Lys Asn 
100 105 110 

He He Leu Glu He Arg Gly Ala Ala Gly Gly Asp Glu Ala Ala Leu 
115 120 125 

Phe Ala Gly Asp Leu Leu Thr Met Tyr Gin Lys Tyr Ala Glu Ala Gin 
130 135 140 

Gly Trp Arg Phe Glu Val Met Glu Ala Ser Met Asn Gly Val Gly Gly 
145 150 155 160 

Phe Lys Glu Val Val Ala Met Val Ser Gly Gin Ser Val Tyr Ser Lys 
165 170 175 
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Leu Lys Tyr Glu Ser Giy Ala His Arg Val Gin Arg Val Pro Val Thr 
180 185 190 

Glu Ser Gin Gly Arg Val His Thr Ser Thr Ala Thr Val Leu Val Met 
5 195 200 205 

Pro Glu Val Glu Glu Val Glu Tyr Asp He Asp Pro Lys Asp Leu Arg 
210 215 220 

10 Val Asp He Tyr His Ala Ser Gly Ala Gly Gly Gin Asn Val Asn Lys 
225 230 235 240 

• Val Ala Thr Ala Val Arg He Val His Leu Pro Thr Asn He Lys Val 
245 250 255 

15 

Glu Met Gin Glu Glu Arg Thr Gin Gin Lys Asn Arg Glu Lys Ala Met 
260 265 270 

Lys lie lie Arg Ala Arg Val Ala Asp His Phe Ala Gin He Ala Gin 
20 275 280 285 

Asp Glu Gin Asp Ala Glu Arg Lys Ser Thr He Gly Thr Gly Asp Arg 
290 295. 300 

25 Ser Glu Arg He Arg Thr Tyr Asn Phe Pro Gin Asn Arg Val Thr Asp 
305 310 315 320 

His Arg lie Gly Leu Thr Leu Gin Lys Leu Asp Thr He Leu Ser Gly 
325 330 335 

30 

Lys Leu Asp Glu Val Val Asp Ala Leu Val Leu Tyr Asp Gin Thr Gin 
340 * 345 350 

Lys Leu Glu Glu Leu Asn Lys 
35 355 



<210> 201 
<211> 559 
.40 - <212> PRT 

<213> Streptococcus pneumoniae 

<400> 201 

Met Ala Tyr Thr Leu Lys Pro Glu Glu Val Gly Val Phe Ala lie Gly 
45 1 5 10 15 

Gly Leu Gly Glu He Gly Lys Asn Thr Tyr Gly lie Glu Tyr Gin Asp 

20 25.. 30 

50 Glu He lie lie Val Asp Ala Gly lie Lys Phe Pro Glu Asp Asp Leu 
35 40 45 

Leu Gly lie Asp Tyr Val lie Pro Asp Tyr Ser Tyr He Val Asp Asn 
50 55 60 

55 

lie Asp Arg Val Lys Ala Val Leu He Thr His Gly His Glu Asp His 
65 70 75 80 
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He Gly Gly He Pro Phe Leu Leu Lys Gin Ala Asn Val Pro He Tyr 
85 90 95 

Ala Gly Pro Leu Ala Leu Ala Leu He Arg Gly Lys Leu Glu Glu His 
100 105 110 

Gly Leu 'Leu Arg Asn Ala Lys Leu Tyr Glu He Asn His Asn Thr Glu 
"115 120 125 

Leu Thr Phe Lys Asn Leu Lys Ala Thr Phe Phe Arg Thr Thr His Ser 
130 135 140 

He Pro Glu Pro Leu Gly He Val He His Thr Pro Gin Gly Lys He 
145 150 155 ■ 160 

Val Cys Thr Gly Asp Phe Lys Phe Asp Phe Thr Pro Val Gly Glu Pro 
165 170 175 

Ala Asp Leu His Arg Met Ala Ala Leu Gly Glu Glu Gly Val Leu Cys 
180 185 190 

Leu Leu Ser Asp Ser Thr Asn Ala Glu Val Pro Thr Phe Thr Asn Ser 
195 200 205 

Glu Lys Val Val Gly Gin Ser He Met Lys He He Gin Gly He Glu 
210 215 220 

Gly Arg He He Phe Ala Ser Phe Ala Ser Asn He Phe Arg Leu Gin 
225 230 235 240 

Gin Ala Thr Glu Ala Ala Val Lys Thr Gly Arg Lys He Ala Val Phe 
245 250 255 

Gly Arg Ser Met Glu Lys Ala He Val Asn Gly He Asp Leu Gly Tyr 
260 265 270 

He Lys Ala Pro Lys Gly Thr Phe He Glu Pro Asn Glu He Lys Asp 
275 280 285 

Tyr Pro Ala Gly Glu Val Leu He Leu Cys Thr Gly Ser Gin Gly Glu 
290 295 300 

Pro Met Ala Ala Leu Ser Arg He Ala Asn Gly Thr His Arg Gin Val 
305 310 315 320 

Gin Leu Gin Pro Gly Asp Thr Val He Phe Ser Ser Ser Pro He Pro 
325 ,330 335 

Gly Asn Thr Thr Ser Val Asn Lys Leu He Asn He He Ser Glu Ala 
340 345 350 

Gly Val Glu Val He His Gly Lys Val Asn Asn He His Thr Ser Gly 
355 360 ' 365 



His Gly Gly Gin Gin Glu Gin Lys Leu Met Leu Cys Leu He Lys Pro 
370 375 380 
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Lys Tyr Phe Met, Pro Val His Gly Glu Tyr Arg Met Gin Lys Val His 
385 390 395 400 

5 Ala Gly Leu Ala Val Asp Thr Gly Val Glu Lys Asp Asn lie Phe lie 

405 410 415 

Met Ser Asn Gly Asp Val Leu Ala Leu Thr Ala Asp Ser Ala Arg lie 
420 425 430 

10 

Ala Gly His Phe Asn Ala Gin Asp lie Tyr Val Asp Gly Asn Arg He 
435 440 445 

Gly Glu lie Giy Ala Ala Val Leu Lys Asp Arg Arg Asp Leu Ser Glu 
15 450 455 460 

Asp Gly Val Val Leu Ala Val Ala Thr Val Asp Phe Lys Ser Gin Met 
465 470 475 480 

20 He Leu Ser Gly Pro Asp He Leu Ser Arg Gly Phe Val Tyr Met Arg* 

485 490 495. 

Glu Ser Gly Asp Leu He Arg Gin Ser Gin Arg He Leu Phe Asn Ala 
500 505 510 

25 

■lie Arg He Ala Leu Lys Asn Lys Asp Ala Ser Val Gin Ser Val Asn 
515 520 525 

Gly Ala He Val Asn Ala lie Arg Pro Phe Leu Tyr Glu Asn Thr Glu 
30 530 535 540 

Arg Glu Pro He He He Pro Met He Leu Thr Pro Asp Glu Glu 
545 550 555 

35 

<210> 202 
<211> 450 
<212> PRT 

<213> Streptococcus pneumoniae 

40 

<400> 202 

Met Ala Glu Val Glu Glu Leu Arg Val Gin Pro Gin Asp He Leu Ala 
15 10 15 

45 Glu Gin Ser Val Leu Gly Ala He Phe He Asp Glu Ser Lys Leu Val 
20 25 30 

Phe Val Arg Glu Tyr lie Glu Ser Arg.Asp Phe Phe Lys Tyr Ala His 
35 40 45 

50 

Arg Leu He Phe Gin Ala Met Val Asp Leu Ser Asp Arg Gly Asp Ala 
50 55 60 

* He Asp Ala Thr Thr Val Arg Thr He Leu Asp Asn Gin Gly Asp Leu 
55 65 70 75 80 

Gin Asn He Gly Gly Leu Ser Tyr Leu Val Glu He Val Asn Ser Val 
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85 



90 



95 



Pro Thr Ser Ala Asn Ala Glu Tyr Tyr Ala Lys He Val Ala Glu Lys 
100 105 110 

Ala Met Leu Arg Arg Leu He Ala Lys Leu Thr Glu Ser Val Asn Gin 
115 120 125 

Ala Tyr Glu Ala Ser Gin Pro Ala Asp Glu He He Ala Gin Ala Glu 
130 135 140 

Lys Gly Leu lie Asp Val Ser Glu Asn Ala Asn Arg Ser Gly Phe Lys 
145 150 155 160* 

Asn He Arg Asp Val Leu Asn Leu Asn Phe Gly Asn Leu Glu Ala Arg 
165 170 175 

Ser Gin Gin Thr Thr Asp He Thr Gly lie Ala Thr Gly Tyr Arg Asp 
180 185 190 

Leu Asp His Met Thr Thr Gly Leu His Glu Glu Glu Leu He He Leu 
195 200 205 

Ala Ala Arg Pro Ala Val Gly Lys Thr Ala Phe Ala Leu Asn He Ala 
210 215 220 

Gin Asn He Gly Thr Lys Leu Asp Lys Thr Val Ala He Phe Ser Leu 
225 230 235 240 

Glu Met Gly Ala Glu Ser Leu Val Asp Arg Met Leu Ala Ala Glu Gly 
245 " 250 . 255 

Leu Val Glu Ser His Ser He Arg Thr Gly Gin Leu Thr Asp Glu Glu 
260 - 265 270 

Trp Gin Lys Tyr Thr He Ala Gin Gly Asn Leu Ala Asn Ala Ser He 
275 280 285 

Tyr He Asp Asp Thr Pro Gly He Arg He Thr Glu He Arg Ser Arg 
290 295 300 

Ser Arg Lys Leu Ala Gin Glu Thr Gly Asn Leu Gly Leu He Val lie 
305 310 315 320 

Asp Tyr Leu Gin Leu lie Thr Gly Thr Gly Arg Glu Asn Arg Gin Gin 
325 " 330 335 

Glu Val Ser Glu He Ser Arg Gin Leu- Lys lie Leu Ala Lys Glu Leu 
340 345 350 

Lys Val Pro Val He Ala Leu Ser Gin Leu Ser Arg Gly Val Glu Gin 
355 360 365 

Arg Gin Asp Lys Arg Pro Val Leu Ser Asp lie Arg Glu Ser Gly Ser 
370 375 380 



lie Glu Gin Asp Ala Asp lie Val Ala Phe Leu Tyr Arg Asp Asp Tyr 
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385 390 395 400 

Tyr Glu Arg Gly Gly Glu Glu Glu Glu Gly He Pro Asn Asn Lys Val 
405 410 415 

Glu Val He He Glu Lys Asn Arg Ser Gly Ala Arg GlyThr Val Glu 
420 425 430 

Leu He Val Gin Lys Glu Tyr Asn Lys Phe Ser Ser He Ser Lys Arg 
435 440 445 

Glu Ala 
450 



<210> 203 
<211> 699 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 203 

Met Ala Thr Ala Thr Lys Lys Lys Lys Ser Thr Val Lys Lys Asn Leu 
15. 10 15 

Val He Val Glu Ser Pro Ala Lys Ala Lys Thr He Glu Lys Tyr Leu 
20 25 30 

Gly Arg Asn Tyr Lys Val Leu Ala Ser Val Gly His He Arg Asp Leu 
35 40 45 

Lys Lys Ser Ser Met Ser Val Asp He Glu Asn Asn Tyr Glu Pro Gin 
50 * 55 60 

Tyr He Asn He Arg Gly Lys Gly Pro Leu He Asn Asp Leu Lys Lys 
65 70 75 80 

Glu Ala Lys Lys Ala Asn Lys Val Phe Leu Ala Ser Asp Pro Asp Arg 
85 90 95 

Glu Gly Glu Ala lie Ser Trp His Leu Ala His lie Leu Asn Leu Asp 
100 105 ' 110 

Glu Asn Asp Ala Asn Arg Val Val Phe Asn Glu He Thr Lys Asp Ala 
115 120 " 125 

Val Lys Asn Ala Phe Lys Glu Pro Arg Lys He Asp Met Asp Leu Val 
130 135 140 

Asp Ala Gin Gin Ala Arg Arg He Leu Asp Arg Leu Val Gly Tyr Ser 
145 150 155 160 

He Ser Pro He Leu Trp Lys Lys Val Lys Lys Gly Leu Ser Ala Gly 
165 170 175 

Arg Val Gin Ser lie Ala Leu Lys Leu lie lie Asp Arg Glu Asn Glu 
180 185 190 
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lie Asn Ala Phe Gin Pro Glu Glu Tyr Trp Thr Val Asp Ala Val Phe 
195 200 ' 205 

Lys Lys Gly Thr Lys Gin Phe His Ala Ser Phe Tyr Gly Val Asp' Gly 

5 210 215 220 

Lys Lys Met Lys Leu Thr Ser Asn Asn Glu Val Lys Glu Val Leu Ser 
225 230 235 240 

10 Arg Leu Thr Ser Lys Asp Phe Ser Val Asp Gin Val Asp Lys Lys Glu 

245 250 255 



15 



Arg Lys Arg Asn Ala Pro Leu Pro Tyr Thr Thr Ser Ser Met Gin Met 

260 265 270 

Asp Ala Ala Asn Lys lie Asn Phe Arg Thr Arg Lys Thr Met Met Val 
275 280 285 



Ala Gin Gin Leu Tyr Glu Gly lie Asn lie Gly Ser Gly Val Gin Gly 
20 290 295 300 

Leu He Thr Tyr Met Arg Thr Asp Ser Thr Arg He Ser Pro Val Ala 
305 310 315 320 

25 Gin Asn Glu Ala Ala Ser Phe He Thr Asp Arg Phe Gly Ser Lys Tyr 

325 330 335 



30 



Ser Lys His Gly Ser Lys Val Lys Asn Ala Ser Gly Ala Gin Asp Ala 
340 345 350 

His Glu Ala He Arg Pro Ser Ser Val Phe Asn Thr Pro Glu Ser He 
355 360 365 



Ala Lys Tyr Leu Asp Lys Asp Gin Leu Lys Leu Tyr Thr Leu He Trp 
35 370 375 380 

Asn Arg Phe Val Ala Ser Gin Met Thr Ala Ala Val Phe Asp Thr Met 
385 390 395 400 

40 Ala Val Lys Leu Ser Gin Lys Gly Val Gin Phe Ala Ala Asn Gly Ser 

405 410 415 



45 



Gin Val Lys Phe Asp Gly Tyr Leu Ala He Tyr Asn Asp Ser Asp Lys 

420 425 430 

Asn Lys Met Leu Pro Asp Met Val Val Gly Asp Val Val Lys Gin Val 
435 440 445 



Asn Ser Lys Pro Glu Gin His Phe Thr Gin Pro Pro Ala Arg Tyr Ser 

50 450 455 460 

Glu Ala Thr Leu He Lys Thr Leu Glu Glu Asn Gly Val Gly Arg Pro 

465 470 475 ' 480 



55 Ser Thr Tyr Ala Pro Thr He Glu Thr He Gin Lys Arg Tyr Tyr Val 

485 490 495 
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Arg Leu Ala Ala Lys Arg Phe Glu Pro Thr Glu Leu Gly Glu lie Val 
500 505 510 

sn Lys Leu lie Val Glu Tyr Phe Pro Asp lie Val Asn Val thr Phe 
515 520 525 . - 

Thr Ala Glu Met Glu Gly Lys Leu Asp Asp Val Glu Val Gly Lys Glu 
530 535 540 



Gin Trp Arg Arg Val lie Asp Ala Phe Tyr Lys Pro Phe Ser Lys Glu 
545 550 555 560 

Val Ala Lys Ala Glu Glu Glu Met Glu Lys He Gin He Lys Asp Glu 
565 570 575 

Pro Ala Gly Phe Asp Cys Glu Val Cys Gly Ser Pro Met .Val He Lys 
580 585 590 



Leu Gly Arg Phe Gly Lys Phe Tyr 
595 600 

Arg His Thr Gin Ala He Val Lys 
610 615 

Cys His Gin Gly Gin He He Glu 
625 630 

Phe Tyr Gly Cys Asn Arg Tyr Pro 
645 



Ala Cys Ser Asn Phe Pro Asp Cys 
605 

Glu He Gly Val Glu Cys Pro Ser 
620 

Arg Lys Thr Lys Arg Asn Arg Leu 
635 640 

Glu Cys Glu Phe Thr Ser Trp Asp 
650 655 



Lys Pro Val Gly Arg Asp Cys. Pro Lys Cys Gly Asn Phe Leu Met Glu 
660 665 670 

Lys Lys Val Arg Gly Gly Gly Lys Gin Val Val Cys Ser Lys Gly Asp 
675 680 685 

Tyr Glu Glu Glu Lys Met Ala Leu Cys Gin Leu 
690 695 



<210> 204 
<211> 326 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 204 

Met Phe lie Ser He Ser Ala Gly 
1 5 

Val Gly He Pro Ala Phe He Gin 
20 

Gly Gin Gin Met His Glu Asp Val 
35 40 

Thr Pro Thr Met Gly Gly Leu Val 



Ile~ Val Thr Phe Leu Leu Thr Leu 
10 15 

Phe Tyr Arg Lys Ala Gin .He Thr 
25 30 

Lys Gin His Gin Ala Lys Ala Gly 
45 

Phe Leu lie Thr Ser Val Leu Val 
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50 



55 



60 



Ala Phe Phe Phe Ala Leu Phe Ser Ser Gin Phe Ser Asn Asn Val Gly 
65 70 75 80 

Met lie Leu Phe lie Leu Val Leu Tyr Gly Leu Val Gly Phe Leu Asp 
85 90 95 

Asp Phe Leu Lys Val Phe Arg Lys lie Asn Glu Gly Leu Asn Pro Lys 
100 105 110 

Gin Lys Leu Ala Leu Gin Leu Leu Gly Gly Val lie Phe Tyr Leu Phe 
115 120 125 

Tyr Glu Arg Gly Gly Asp Met Leu Ser Val Phe Gly Tyr Gin Val His 
130 135 140 

Leu Gly lie Phe Tyr lie Val Phe Ala Leu Phe Trp Leu Val Gly Phe 
145 150 155 160 

Ser Asn Ala Val Asn Leu Thr Asp Gly Val Asp Gly Leu Ala Ser lie 
165 170 175 

Ser Val Val lie Ser Leu Ser Ala Tyr Gly Val He Ala Tyr Val Gin 
180 185 190 

Gly Gin Met Asp He Leu Leu Val He Leu Ala Met He Gly Gly Leu 
195 200 205 

Leu Ser Phe Phe He Phe Asn His Lys Pro Ala Lys He Phe Met Gly 
210 215 220 

Asp Val Gly Ser Leu Ala Leu Gly Gly Met Leu Ala Ala He Ser Met 
225 230 235 240 

Ala Leu His Gin Glu Trp Thr Leu Leu He He Gly He Val Tyr Val 
245 250 255- 

Phe Glu Thr Thr Ser Val Met Met Gin Val Ser Tyr Phe Lys Leu Thr 
260 265 270 

Gly Gly Lys Arg He Phe Arg Met Thr Pro Val His His His Phe Glu- 
275 280 285 

Leu Gly Gly Leu Ser Gly Lys Gly Asn Pro Trp Ser Glu Trp Lys Val 
290 295 300 

Asp Phe Phe Phe Trp Gly Val Gly Leu' "Leu Ala Ser Leu Leu Thr Leu 
305 310 315 320 



Ala He Leu Tyr Leu Met 
325 



<210> 205 
<211> 693 
<212> PRT 
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<213> Streptococcus pneumoniae 
<400> 205 

Met Ala Arg Glu Phe Ser Leu Glu Lys Thr Arg Asn lie Gly lie Met 
1 5 10 15 

Ala His Val Asp Ala Gly. Lys Thr Thr Thr Thr Glu Arg lie Leu Tyr 
20 25 30 

Tyr Thr Gly Lys lie His Lys lie Gly Glu Thr His Glu Gly Ala Ser 
35 40 45 . 

Gin Met Asp Trp Met Glu Gin Glu'Gln Glu Arg Gly lie Thr lie Thr 
50 55 . 60 

Ser Ala Ala Thr Thr Ala Gin Trp Asn Asn His Arg Val Asn lie lie 
65 70 75 80 

Asp Thr Pro Gly His Val Asp Phe Thr He Glu Val Gin Arg Ser Leu 
85 90 95 

Arg Val Leu Asp Gly Ala Val Thr Val Leu Asp Ser Gin Ser Gly Val 
100 105 110 

Glu Pro Gin Thr Glu Thr Val Trp Arg Gin Ala Thr Glu Tyr Gly Val 
115 120 125 

Pro Arg He Val Phe Ala Asn Lys Met Asp Lys He Gly Ala Asp Phe 
130 135 140 

Leu Tyr Ser Val Ser Thr Leu His Asp Arg Leu Gin Ala Asn Ala His 
145 . ' 150 155 160 

Pro He Gin Leu Pro He Gly Ser Glu Asp Asp Phe Arg Gly He He 
165 170 175 

Asp Leu He Lys Met Lys Ala Glu lie Tyr Thr Asn Asp Leu Gly Thr 
180 185 190 

Asp He Leu Glu Glu Asp He Pro Ala Glu Tyr Leu Asp Gin Ala Gin 
195 200 205 

Glu Tyr Arg Glu Lys Leu He Glu Ala Val Ala Glu Thr Asp Glu Glu 
210 215 220 

Leu Met Met Lys Tyr Leu Glu Gly Glu Glu He Thr Asn Glu Glu Leu 
225 230 235 240 

Lys Ala Gly He Arg Lys Ala Thr He Asn Val Glu Phe Phe Pro Val 
245 250 255 

Leu Cys Gly Ser Ala Phe Lys Asn Lys Gly Val Gin Leu Met Leu Asp 
260 265 270 

Ala Val He Asp Tyr Leu Pro Ser Pro Leu Asp He Pro Ala He Lys 
275 280 285 
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Gly He Asn Pro Asp Thr Asp Ala Glu Glu He Arg Pro Ala Ser Asp 
290 295 300 

Glu Glu Pro Phe Ala Ala Leu Ala Phe Lys lie Met Thr Asp Pro Phe 
5 305 310 315 320 

Val Gly Arg Leu Thr Phe Phe Arg Val Tyr Ser Gly Val Leu Gin Ser 
325 330 335 

10 Gly Ser Tyr Val Leu Asn Thr Ser Lys Gly Lys Arg Glu Arg He Gly 
340 345 350 



15 



Arg He Leu Gin Met His Ala Asn Ser Arg Gin Glu He Asp Thr Val 
355 360 365 

Tyr Ser Gly Asp He Ala Ala Ala Val Gly Leu Lys Asp Thr Thr Thr 

370 375 380 



Gly Asp Ser Leu Thr Asp Glu Lys Ala Lys He He Leu Glu Ser He 

20 385 390 395 400 

Asn Val Pro Glu Pro Val lie Gin Leu Met Val Glu Pro Lys Ser Lys 
405 410 415 

25 Ala Asp Gin Asp Lys Met Gly He Ala Leu Gin Lys Leu Ala Glu Glu 
420 425 430 



30 



Asp Pro Thr Phe . Arg Val Glu Thr Asn Val Glu Thr Gly Glu Thr Val 
435 440 445 

lie Ser Gly Met Gly Glu Leu His Leu Asp Val Leu Val Asp Arg Met 
450 "455 460 



Arg Arg Glu Phe Lys Val Glu Ala Asn Val Gly Ala Pro Gin Val Ser 
35 465 470 475 480 

Tyr Arg Glu Thr Phe Arg Ala Ser Thr Gin Ala Arg Gly Phe Phe Lys 
485 490 495 

40 Arg Gin Ser Gly Gly Lys Gly Gin Phe Gly Asp Val Trp He Glu Phe 
500 505 510 



45 



Thr Pro Asn Glu Glu Gly Lys Gly Phe Glu Phe Glu Asn Ala He Val 
515 520 525 

Gly Gly Val Val Pro Arg Glu Phe He Pro Ala Val Glu Lys Gly Leu 

530 535 540 



Val Glu Ser Met Ala Asn Gly Val Leu Ala Gly Tyr Pro Met Val Asp 
50 545 550 555 560 

Val Lys Ala Lys Leu Tyr Asp Gly Ser Tyr His Asp Val Asp Ser Ser 
565 570 575 



55 Glu Thr Ala Phe Lys He Ala Ala Ser Leu Ser Leu Lys Glu Ala Ala 
580 585 590 
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Lys Ser Ala Gin Pro Ala lie Leu Glu Pro Met Met Leu Val Thr He 
595 600 605 

Thr Val Pro Glu Glu Asn Leu Gly Asp Val Met Gly His Val Thr Ala 
610 615 620 

Arg Arg Gly Arg Val Asp Gly Met Glu Ala His Gly Asn Ser Gin He 
625 - 630 635 640 

Val Arg Ala Tyr Val Pro Leu Ala Glu Met Phe Gly Tyr Ala Thr Val 
645 650 655 

Leu Arg Ser Ala Ser Gin Gly Arg Gly Thr Phe Met Met Val Phe Asp 
660 665 670 

His Tyr Glu Asp Val Pro Lys Ser Val Gin Glu Glu He He Lys Lys 
675 680 685 

Asn Lys Gly Glu Asp 
690 



<210> 206 
<211> 408 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 206 

Met Pro Asn Tyr Asn He Pro Phe Ser Pro Pro Asp He Thr Glu Ala 
15 10 15 

Glu He Ala Glu Val Ala Asp Thr" Leu Arg Ser Gly Trp He Thr Thr 
20 25 30 

Gly Pro Lys Thr Lys Glu Leu Glu Arg Arg Leu Ser Leu Tyr Thr Gin 
35 40 45 

Thr Pro Lys Thr Val Cys Leu Asn Ser Ala Thr Ala Ala Leu Glu Leu 
50 55 60 

He Leu Arg Val Leu Glu Val Gly Pro Gly Asp Glu Val He Val Pro 
65 70 75 80 

Ala Met Thr Tyr Thr Ala Ser Cys Ser Val He Thr His Val Gly Ala 
85 90 95 

Thr Pro Val Met Val Asp He Gin Ala Asp Thr Phe Glu Met Asp Tyr 
100 105- - 110 

Asp Leu Leu Glu Gin Ala He Thr Glu Lys Thr Lys Val He He Pro 
115 120 125 

Val Glu Leu Ala Gly He Val Cys Asp Tyr Asp Arg Leu Phe Gin Val 
130 135 140 

Val Glu Lys Lys Arg Asp Phe Phe Thr Ala Ser Ser Lys Trp Gin Lys 
145 150 155 160 
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Ala Phe Asn Arg lie Val He Val Ser Asp Ser Ala His Ala Leu Gly 
165 170 175 

Ser Thr Tyr Lys Gly Gin Pro Ser Gly Ser He Ala Asp Phe Thr Ser 
180 185 190 

Phe Ser Phe His Ala Val Lys Asn Phe Thr Thr Ala Glu Gly Gly Ser 
195 200 205 

Ala Thr Trp Lys' Ala Asn Pro Val He Asp Asp Glu Glu Met Tyr Lys 
210 • 215 220 

Glu Phe Gin He Leu Ser Leu His Gly Gin Thr Lys Asp Ala Leu Ala 
225 230 235 240 

Lys Met Gin Leu Gly Ser Trp Glu Tyr Asp He Val Thr Pro Ala Tyr 
245 250 255 

Lys Cys Asn Met Thr Asp He Met Ala Ser Leu Gly Leu Val Gin Leu 
260 265 270 

Asp Arg Tyr Pro Ser Leu Leu Gin Arg Arg Lys Asp He Val Asp Arg 
275 280 285 

Tyr Asp Ser Gly Phe Ala Gly Ser Arg He His Pro Leu Ala His Lys 
290 295 300 

Thr Glu Thr Val Glu Ser Ser Arg His Leu Tyr He Thr Arg Val Glu 
305 310 315 320 

Gly Ala Ser Leu Glu Glu Arg Ser Leu He He Gin Glu Leu Ala Lys 
325 330 335 

Ala Gly He Ala Ser Asn Val His Tyr Lys Pro Leu Pro Leu Leu Thr 
340 345 350 

Ala Tyr Lys Asn Leu Gly Phe Asp Met Thr Asn Tyr Pro Lys Ala Tyr 
355 360 365 

Ala Phe Phe Glu Asn Glu He Thr Leu Pro Leu His Thr Lys Leu Ser 
370 375 380 

Asp Glu Glu Val Asp Tyr He He Glu Thr Phe Lys Thr Val Ser Glu 
385 390 395 400 

Lys Val Leu Thr Leu Ser Lys Lys 
405 



<210> 207 
<211> 325 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 207 

Met Thr Glu Pro Asp Phe Trp Asn Asp Asn He Ala Ala Gin Lys Thr 
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10 



15 



Ser Gin Glu Leu Asn Val Phe Lys Asn Thr Tyr Asn Thr Phe His Lys 
20 25 30 

5 

Met Glu Glu Leu Gin Asp Glu Val Glu lie Leu Leu Asp Phe Leu Ala 
35 40 45 

Glu Asp Glu Ser Val His Asp Glu Leu Val Ala Gin Leu Ala Glu Leu 
10 50 55 60 

Asp Lys He Met Thr Ser Tyr Glu Met Thr Leu Leu Leu Ser Glu Pro 
65 '70 - 75 80 

15 Tyr Asp His Asn Asn Ala He Leu Glu He His Pro Gly Ser Gly Gly 

85 90 95 



20 



Thr Glu Ala Gin Asp Trp Gly Asp Met Leu Leu Arg Met Tyr Thr Arg 

100 105 110 

Tyr Gly Asn Ala Lys Gly Phe Lys Val Glu Val Leu Asp Tyr Gin Ala 
115 120 125 



Gly Asp Glu Ala Gly He Lys Ser Val Thr Leu Ser Phe Glu Gly Pro 
25 130 135 140 

Asn Ala Tyr Gly Leu Leu Lys Ser Glu Met Gly Val His Arg Leu Val 
145 150 155 160 

30 Arg He Ser Pro Phe Asp Ser Ala Lys Arg Arg His Thr Ser Phe Thr 

165 170 175 



35 



Ser Val Glu Val Met Pro Glu Leu Asp Asp Thr He Glu Val Glu He 
180 185 190 

Arg Glu Asp Asp He Lys Met Asp Thr Phe Arg Ser Gly Gly Ala Gly 
195 200 205 



Gly Gin Asn Val Asn Lys Val Ser Thr Gly Val Arg Leu Thr His He 
40 210 215 220 

Pro Thr Gly He Val Val Gin Ser Thr Val Asp Arg Thr Gin Tyr Gly 

225 230 235 240 

45 Asn Arg Asp Arg Ala Met Lys • Met Leu Gin Ala Lys Leu Tyr Gin Met 

245 250 255 



50 



Glu Gin Glu Lys Lys Ala Ala Glu Val Asp Ser Leu Lys Gly Glu Lys 
260 265 270 

Lys Glu He Thr Trp Gly Ser Gin He Arg Ser Tyr Val Phe Thr Pro 
275 280 285 



Tyr Thr Met Val Lys Asp His Arg Thr Ser Phe Glu Val Ala Gin Val 
55 290 295 300 



Asp Lys Val Met Asp Gly Asp Leu Asp Gly Phe He Asp Ala Tyr Leu 
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305 



310 



Lys Trp Arg lie Ser 
325 



315 



320 



<210> 208 

<211> 249 

<212> PRT 

<213> Streptococcus pneumoniae 

<400> 208 

Met Phe Tyr Thr Tyr Leu Arg Gly Leu Val Val Leu Leu Leu Trp Ser 
1 5 10 15 

He Asn Gly Asn Ala His Tyr His Asn Thr Asp Lys He Pro Asn Gin 
20 25 30 

Asp Glu Asn Tyr He Leu Val Ala Pro His Arg Thr Trp Trp Asp Pro 
35 40 45 

Val Tyr Met Ala Phe Ala Thr Lys Pro Lys Gin Phe He Phe Met Ala 
50 55 60 

Lys Lys Glu Leu Phe Thr Asn Arg He Phe Gly Trp Trp He Arg Met 
65 70 75 80 

Cys Gly Ala Phe Pro He Asp Arg Glu Asn Pro Ser Ala Ser Ala He 
85 90 95 

Lys Tyr Pro lie Asn Val Leu Lys Lys Ser Asp Arg Ser Leu He Met 
100 105 110 

Phe Pro Ser Gly Ser Arg His- Ser Asn Asp Val Lys Gly Gly Ala Ala 
115 120 125 - 

Leu He Ala Lys Met Ala Lys Val Arg He Met Pro Val Thr Tyr Thr 
130 135 140 

Gly Pro Met Thr Leu Lys Gly Leu He Ser Arg Glu Arg Val Asp Met 
145 150 155 160 

Asn Phe Gly Asn Pro He Asp He Ser Asp He Lys Lys Met Asn Asp 
165 170 .175' 

Glu Gly He Glu Thr Val Ala Asn Arg He Gin Thr Glu Phe Gin Arg 
180 185 190 

Leu Asp Glu Glu Thr Lys Gin Trp His Asn Asp Lys Lys Pro Asn Pro 
195 200 205 

Leu Trp Trp Phe He Arg He Pro Ala Leu He Leu Ala He He Leu 
210 215 220 

Ala He Leu Thr He He Phe Ser Phe He Ala Ser Phe He Trp Asn 
225 230 235 240 
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Pro Asp Lys Lys Arg Glu Glu Leu Ala 
245 



<210> 209 
<211> 1033 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 209 

Met He Ala Gin Leu Asp Thr Lys Thr Val Tyr Ser Phe Met Glu Ser 
15 10 15 

Val He Ser He Glu Lys Tyr Val Arg Ala Ala Lys Glu Tyr Gly Tyr 
20 25 30 

Thr His Leu Ala Met Met Asp He Asp Asn Leu Tyr Gly Ala Phe Asp 
35 40 45 

Phe Leu Glu He Thr Lys Lys Tyr Gly He. His Pro Leu Leu Gly Leu 
50 55 60 

Glu Met Thr Val Phe Val Asp Asp Gin Gly Val Asn Leu Arg Phe Leu 
65 70 75 80 

Ala Leu Ser Ser Val Gly Tyr Gin Gin Leu Met Lys Leu Ser Thr Ala 
. 85 90 95 

Lys Met Gin Gly Glu Lys Thr Trp Ser Val Leu Ser Gin Tyr Leu Glu 
100 105 HO 

Asp He Ala Val He Val Pro Tyr Phe Asp Arg Val Glu Ser Leu Glu 
115 120 125 

Leu Gly Cys Asp Tyr Tyr He Gly Val Tyr Pro Glu Thr Leu Ala Ser 
130 135 140 

Glu Phe His His Pro He Leu Pro Leu Tyr Arg Val Asn Ala Phe Glu 
145 150 ' 155 160 

Ser Arg Asp Arg Glu Val Leu Gin Val Leu Thr Ala He Lys Glu Asn 
165 170 175 

Leu Pro Leu Arg Glu Val Pro Leu Arg Ser Arg Gin Asp Val Phe He 
180 185 190 

Ser Ala Ser Ser Leu Glu Lys Leu Phe Gin Glu Arg Phe Pro Gin Ala 
195 200 205 

Leu Asp Asn Leu Glu Lys Leu He Ser Gly He Ser Tyr Asp Leu Asp 
210 215 220 

Thr Ser Leu Lys Leu Pro Arg Phe Asn Pro Ala Arg Pro Ala Val Glu 
225 230 235 240 

Glu Leu Arg Glu Arg Ala Glu Leu Gly Leu Val Gin Lys Gly Leu Thr 
245 250 255 
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Ser Lys Glu Tyr Gin Asp Arg Leu Asp Gin Glu Leu Ser Val He His 
260 265 270 

5 Asp Met Gly Phe Asp Asp Tyr Phe Leu Val Val Trp Asp Leu Leu Arg 
275 280 285 

Phe Gly Arg Ser Asn Gly Tyr Tyr Met Gly Met Gly Arg Gly Ser Ala 
290 295 300 

10 

Val Gly Ser Leu Val Ser Tyr Ala Leu Asp He Thr Gly He Asp Pro 
305 310 315 320 

Val Glu Lys Asn Leii lie Phe Glu Arg Phe Leu Asn Arg Glu Arg Tyr 
15 325 330 335 

Thr Met Pro Asp He Asp lie Asp lie Pro Asp lie Tyr Arg Pro Asp 
340 345 350 



20 Phe lie Arg Tyr Val Gly Asn Lys Tyr Gly Ser Lys His Ala Ala Gin 
355 360 365 



■25 



lie Val Thr Phe Ser Thr Phe Gly Ala Lys Gin Ala Leu Arg Asp Val 
370 . 375 380 

Leu Lys Arg Phe Gly Val Pro Glu Tyr Glu Leu Ser Ala lie Thr Lys 
385 ■ 390 395 400 



Lys lie Ser Phe Arg Asp Asn Leu Lys Ser Ala Tyr Glu Gly Asn Leu 
30 405 410 415 

'* 

Gin Phe Arg Gin Gin lie Asn Ser Lys Leu Glu Tyr Gin Lys Ala Phe 
420 425 430 

35 Glu' He Ala Cys Lys He Glu Gly Tyr Pro Arg Gin Thr Ser Val His 
435 440 445 



40 



Ala Ala Gly Val Val He Ser Asp Gin Asp Leu Thr Asn Tyr lie Pro 
450 455 460 

Leu Lys Tyr Gly Asp Glu lie Pro Leu Thr Gin Tyr Asp Ala His Gly 
465 470 475 480 



Val Glu Ala Ser Gly Leu Leu Lys Met Asp Phe Leu Gly Leu Arg Asn 

45 485 490 495 

Leu Thr Phe Val Gin Lys Met Gin Glu Leu Leu Ala Glu lie Glu Gly 
500 505" 510 



50 lie His Leu Lys lie Glu Glu lie Asp Leu Glu Asp Lys Glu Thr Leu 
515 520 525 

Asp Leu Phe Ala Ser Gly Asn Thr Lys Gly He Phe Gin Phe Glu Gin 
530 535 540 

55 

Pro Gly Ala He Arg Leu Leu Lys Arg. Val Gin Pro Val Cys Phe Glu 
545 550 555 560 
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Asp Val Val Ala Thr Thr Ser Leu Asn Arg Pro Gly Ala Ser Asp Tyr 
565 570 575 

lie Asn Asn Phe Val Ala Arg Lys His Gly Gin Glu Glu Val Thr Val 
580 585 590 

Leu Asp Pro Val Leu Glu Asp lie Leu Ala Pro Thr Tyr Gly lie Met 
595 600 605 

Leu Tyr Gin Glu Gin Val Met Gin Val Ala Gin Arg Phe Ala Gly Phe 
610 615 620 



Ser Leu Gly Lys Ala Asp lie Leu Arg Arg Ala Met Gly Lys Lys Asp 
15 625 630 635 640 

Ala Ser Ala Met His Glu Met Arg Ala Ser Phe lie Gin Gly Ser Leu 
645 650 655 

20 Glu Ala Gly His Thr Val Glu Lys Ala Glu Gin Val Phe Asp Val Met 
660 665 670 



Glu Lys Phe Ala Gly Tyr Gly Phe Asn Arg Ser His Ala Tyr Ala Tyr 
675 ' 680 685 

Ser Ala Leu Ala Phe Gin Leu Ala Tyr Phe Lys Thr His Tyr Pro Ala 
690 695 700 



lie Phe Tyr Gin He Met Leu Asn Ser Ala Asn Ser Asp Tyr Leu He 

30 705 710 715 720 

Asp Ala Leu Glu Ala Gly Phe Glu Val Ala Pro Leu Ser He Asn Thr 
725 730 735 

35 He Pro Tyr His- Asp Lys He Ala Asn Lys Ala He Tyr Leu Gly Leu 

740 745 750 



Lys Ser He Lys Gly Val Ser Asn Asp . Leu Ala Leu Trp lie He Glu 
755 760 765 

His Arg Pro Tyr Ser Asn He Glu Asp Phe lie Ala Lys Leu Pro Glu 
770 775 780 



Asn Tyr Leu Lys Leu Pro Leu Leu Glu Pro Leu Val Lys Val Gly Leu 
45 785 790 795 800 

Phe Asp Ser Phe Glu Lys Asn Arg Gin Lys Val Phe Asn Asn Leu Ala 
805 '810 815 

50 Asn -Leu Phe Glu Phe Val Lys Glu Leu Gly Ser Leu Phe Gly Asp Ala 
820 825 830 



He Tyr Ser Trp Gin Glu Ser Glu Asp Trp Thr Glu Gin Glu Lys Phe 
835 840 845 

Tyr Met Glu Gin Glu Leu Leu Gly He Gly Val Ser Lys His Pro Leu 
850 855 860 
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Gin Ala He Ala Ser Lys Ala He Tyr Pro He Thr Pro He Gly Asn 
865 870 875 880 

5 Leu Ser Glu Asn Ser Tyr Ala lie lie Leu Val Glu Val Gin Lys He 

885 890 . 895 

Lys Val He Arg Thr Lys Lys Gly Glu Asn Met Ala Phe Leu Gin Ala 
900 905 910 

10 

Asp Asp Ser Lys Lys Lys Leu Asp Val Thr Leu Phe Ser Asp- Leu Tyr 
915 920 925 

Arg Gin Val Gly Gin Glu He Lys Glu Gly Ala Phe Tyr Tyr Val Lys 
15 930 935 940 

Gly Lys He Gin Ser Arg Asp Gly Arg Leu Gin Met He Ala Gin Glu 
945 950 955 960 

20 He Arg Glu Ala Val Ala Glu Arg Phe Trp He Gin Val Lys Asn His 

965 ' 970 975 



25 



Glu Ser Asp Gin Glu He Ser Arg He Leu Glu Gin Phe Lys Gly Pro 
980 985 990 

He Pro Val He He Arg Tyr Glu Glu Glu Gin Lys Thr He Val Ser 
995 1000 1005 



Pro His His Phe Val Ala Lys Ser Asn Glu Leu Glu Glu Lys Leu Asn 
30 1010 . 1015 1020 



35 



40 



Glu He Val Met Lys Thr He Tyr Arg 
1025 1030 



<210> 210 
<211> 306 
<212> PRT 

<213> Streptococcus pneumoniae 



<400> 210 

Met Thr Asn Glu Phe Leu His Phe Glu Lys lie Ser Arg Gin Thr Trp 
1 5 10 15 

45 Gin Ser Leu His Arg Lys Thr Thr Pro Pro Leu Thr Glu Glu Glu Leu 
20 25 30 

Glu Ser He Lys Ser Phe Asn Asp Gin He Ser Leu Gin Asp Val Thr 
35 40 45 

50 

Asp lie Tyr Leu Pro Leu Ala His Leu lie Gin lie Tyr Lys Arg Thr 
50 55 60 

Lys Glu Asp Leu Ala Phe Ser Lys Gly lie Phe Leu Gin Arg Glu Ser 
55 65 70 75 80 

Lys Ser Gin Pro Phe lie lie Gly Val Ser Gly Ser Val Ala Val Gly 
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85 90 95 

Lys Ser Thr Thr Ser Arg Leu Leu Gin lie Leu Leu Ser Arg Thr Phe 
100 105 110 ' 

5 

Thr Asp Ala Thr Val Glu Leu Val Thr Thr Asp Gly Phe Leu Tyr Pro 
115 120 125 

Asn Gin Thr Leu lie Glu Gin Gly lie Leu Asn Arg Lys Gly Phe Pro 
10 130 135 140 

Glu Ser Tyr Asp Met Glu Ala Leu Leu Asn Phe Leu Asp Arg lie Lys 
145 150 155 160 

15 Asn Gly Gin Asp Val Asp He Pro Val Tyr Ser His Glu Val Tyr Asp 

165 170 175 

He Val Pro Lys Lys Lys Gin Ser Val Lys Ala Ala Asp Phe Val He 
180 185 190 

20 

Val Glu Gly lie Asn Val Phe Gin Asn Pro Gin Asn Asp Arg Leu Tyr 
195 200 205 

He Thr Asp Phe Phe Asp Phe Ser He Tyr Val Asp Ala Gly Val Asp 
25 210 215 220 

Asp He Glu Ser Trp Tyr Leu Asp Arg Phe Leu Lys Met Leu Ser Leu 
225 230 235 240 

30 Ala Gin Asn Asp Pro Asp Ser Tyr Tyr Tyr Arg Phe Thr Gin Met Pro 

245 250 255 

He Gly Glu Val Glu Ser Phe Ala His Gin Val Trp Thr Ser He Asn 
260 265 270 

35 

Leu Thr Asn Leu Gin Asn Tyr He Glu Pro Thr Arg Asn Arg Ala Glu 
- 275 280 285 

Val He Leu His Lys Ser Lys Asn His Glu He Asp Glu He Tyr Leu 
40 290 295 300 

Lys Lys 
305 

45 

<210> 211 
<211> 246 
<212> PRT 

<213> Streptococcus pneumoniae 

50 

<400> 211 

Met Glu He Ser Leu Leu Thr Asp Val Gly Gin Lys Arg Thr Asn Asn 
1 5 10 15 . • 

55 Gin Asp Tyr Val Asn His Tyr Val Asn Arg Ala Gly Arg Thr Met He 
20 25 30 
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He Leu Ala Asp Gly Met Gly Gly His Arg Ala Gly Asn He Ala Ser 
35 40 45 

Glu -Met Ala Val Thr Asp Leu Gly Val Ala Trp Val Asp Thr Gin He 
5 50 55 . 60 

Asp Thr Val Asn Glu Val Arg Glu Trp Phe Ala His Tyr Leu Glu He 
65 70 75 80 

10 Glu Asn Gin Lys He His Gin Leu Gly Gin Asp Glu Ala Tyr Arg Gly 

85 90 95 

Met Gly Thr Thr Leu Glu Val Leu Ala lie lie Asp Asn Gin Ala lie 
100 105 HO 

15 

Tyr Ala His He Gly Asp Ser Arg He Gly Leu lie Arg Gly Glu Glu 
115 120 125 

Tyr His Gin Leu Thr Ser Asp His Ser Leu Val Asn Glu Leu Leu Lys 
20 130 135 140 

Ala Gly Gin Leu Thr Pro Glu Glu Ala Glu Ala His Pro Gin Lys Asn 
145 150 155 160 

25 lie lie Thr Gin Ser lie Gly Gin Lys Asp Glu lie Gin Pro Asp Phe 

165 170 175 

Gly Thr Val He Leu Glu Ser Gly Asp Tyr- Leu Leu Leu Asn Ser Asp 
180 185 190 

30 

Gly Leu Thr Asn Met lie Ser. Gly Ser Glu lie Arg Asp He Val Thr 
195 200 205 

Ser Asp He Pro Leu Ala Asp Lys Thr Glu Thr Leu Val Arg Phe Ala 
35 210 215 220 

Asn Asn Ala Gly Gly Leu Asp Asn He Thr Val Ala Leu Val Ser Met 
225 - 230 • 235 240 

40 Asn Glu Glu Asp Glu Glu 

245 



<210> 212 
45 <211> 276 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 212 

50 Met Thr He Gin Met Lys Asn Thr Gly Lys Arg He Asp Leu lie Ala 
1 5 10 15 . 

Asn Arg Lys Pro Gin Ser Gin Arg Val Leu Tyr Glu Leu Arg Asp Arg 
20 25 30 

55 

Leu Lys Arg Asn Gin Phe lie Leu Asn Asp Thr Asn Pro Asp lie Val 
35 40 45 
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He Ser He Gly Gly Asp Gly Met Leu Leu Ser Ala Phe His Lys Tyr 
50 55 60 

Glu Asn Gin Leu Asp Lys Val Arg Phe He Gly Leu His Thr Gly His 
65 70 75 80 

Leu Gly Phe Tyr Thr Asp Tyr Arg Asp" Phe Glu Leu Asp Lys Leu Val 
85 90 95 

Thr Asn Leu Gin Leu Asp Thr Gly Ala Arg Val Ser Tyr Pro Val Leu 
100 105 110 

Asn Val Lys Val Phe Leu Glu Asn Gly Glu Val Lys He Phe Arg Ala 
115 120 125 

Leu Asn Glu Ala Ser He Arg Arg Ser Asp Arg. Thr Met Val Ala Asp 
130 135 140 

He Val He Asn Gly Val Pro Phe Glu Arg Phe Arg Gly Aspi Gly Leu 
145 150 155 160 

Thr Val Ser Thr Pro Thr Gly Ser Thr Ala Tyr Asn Lys Ser Leu Gly 
165 170 175 

Gly Ala Val Leu His Pro Thr lie Glu Ala Leu Gin Leu Thr Glu He 
180 185 190 

Ala Ser Leu Asn Asn Arg Val Tyr Arg Thr Leu Gly Ser Ser He He 
195 200 205 

Val Pro Lys Lys Asp Lys He Giu Leu He Pro Thr Arg Asn Asp Tyr 
210 - 215 220 

His Thr He Ser Val Asp Asn Ser Val Tyr Ser Phe Arg Asn He Glu 
225 230 235 240 

Arg He Glu Tyr Gin He Asp His His Lys He His Phe Val Ala Thr 
245 250 255 

Pro Ser His Thr Ser Phe Trp Asn Arg Val Lys Asp Ala Phe He Gly 
260 265 270 

Glu Val Asp Glu 
275 



<210> 213 
<211> 540 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 213 

Met Ser Lys Glu He Lys Phe Ser Ser Asp Ala Arg Ser Ala Met Val 
15 10 15 

Arg Gly Val Asp He Leu Ala Asp Thr Val Lys Val Thr Leu Gly Pro 
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20 



25 



30 



Lys Gly Arg Asn Val Val Leu Glu Lys Ser Phe Gly Ser Pro Leu He 
35 40 45 

5 

Thr Asn Asp Gly Val Thr He Ala Lys Glu He Glu Leu Glu Asp His 
50 55 60 

Phe Glu Asn Met Gly Ala Lys Leu Val Ser Glu Val Ala Ser Lys Thr 
10 65 70 75 80 

* Asn Asp He Ala Gly Asp Gly Thr Thr Thr Ala Thr Val Leu Thr Gin 
85 90 95 

15 Ala He Val Arg Glu Gly He Lys Asn Val Thr Ala Gly Ala Asn Pro 
100 105 110 



20 



He Gly He Arg Arg Gly He Glu Thr Ala Val Ala Ala Ala Val Glu 
115 120 125 

Ala Leu Lys Asn Asn Ala He Pro Val Ala Asn Lys Glu Ala He Ala 
130 135 140 



Gin Val Ala Ala Val Ser Ser Arg Ser Glu Lys Val Gly Glu Tyr He 

25 145 150 155 160 

Ser Glu Ala Met Glu Lys Val Gly Lys Asp Gly Val lie Thr He Glu 
165 170 . 175 

30 Glu Ser Arg Gly Met Glu Thr Glu Leu Glu Val Val Glu Gly Met Gin 
180 185 - 190 



35 



Phe Asp Arg Gly Tyr Leu Ser Gin Tyr Met Val Thr Asp Ser Glu Lys 
195 * 200 205 

Met Val* Ala Asp Leu Glu Asn Pro Tyr He Leu He Thr Asp Lys Lys 
210 215 220 



lie Ser Asn He Gin Glu He Leu Pro Leu Leu Glu Ser He Leu Gin 
40 225* 230 235 240 

Ser Asn Arg Pro Leu Leu He He Ala Asp Asp Val Asp Gly Glu Ala 
245 250 255 

45 Leu Pro Thr Leu Val Leu Asn Lys He Arg Gly Thr. Phe Asn Val Val 
260 265' '270 



50 



55 



Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Ala Met Leu Glu 
275 280 285 

Asp He Ala He Leu Thr Gly Gly Thr Val He Thr Glu Asp Leu Gly 
290 295 300 

Leu Glu Leu Lys Asp Ala Thr He Glu Ala Leu Gly Gin Ala Ala Arg 
305 310 315 320 



Val Thr Val Asp Lys Asp Ser Thr Val He Val Glu Gly Ala Gly Asn 
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325 330 335 



Pro Glu Ala lie Ser His Arg Val 
340 

Thr. Thr Thr Ser Glu Phe Asp Arg 
355 360 

Lys Leu Ser Gly Gly Val Ala Val 
370 375 

Thr Glu Leu Lys Glu Met Lys Leu 
385- ' 390 

Thr Arg Ala Ala Val Glu Glu Gly 
405 

Leu Ala Asn Val He Pro Ala Val 
420 

Glu Ala Thr Gly Arg Asn He Val 
435 440 

Arg Gin He Ala His Asn Ala Gly 
450 455 

Arg Leu Lys Asn Ala Glu Leu Gly 
465 470 



Ala Val He Lys Ser Gin He Glu 
345 350 

Glu Lys Leu Gin Glu Arg Leu Ala 
365 

He Lys Val Gly Ala Ala Thr Glu 
380 

Arg lie Glu Asp Ala Leu Asn Ala 
•3-95 400 

He Val Ala Gly Gly Gly Thr Ala 
410 415 

Ala Thr Leu Glu Leu Thr Gly Asp 
425 43.0 

Leu Arg Ala Leu Glu Glu Pro Val 
445 

Phe Glu Gly Ser lie Val He Asp 
460 

He Gly Phe Asn Ala Ala Thr Gly 

475 480 



Glu Trp Val Asn Met He Asp Gin 
485 

Ser Arg Ser Ala Leu Gin Asn Ala 

- 500 

Thr Thr Glu Ala Val Val Ala Asn 
515 520 

Pro Ala Met Asp Pro Ser Met Met 
530 535 



Gly He He Asp Pro Val Lys Val 
490 . 495 

Ala Ser Val Ala Ser Leu He Leu 
505 510 

Lys Pro Glu Pro Val Ala Pro Ala 
525 * 

Gly Gly Met Met 
540 



<210> 214 
<211> 481 
<212> PRT 

<213> Streptococcus pneumoniae 



<400> 214 

Met He Lys lie Glu Thr Val Leu Asp- lie Leu Lys Lys Asp Gly Leu. 
1 5 10 15 

Phe Arg Glu lie He Asp Gin Gly His Tyr His Tyr Asn Tyr Ser Lys 
20 25 30 

Val lie Phe Asp Ser He Ser Tyr Asp Ser Arg Lys Val Thr Glu Asp 
35 40 45 
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Thr Leu Phe Phe Ala Lys Gly Ala Ala Phe Lys Lys Glu Tyr Leu Leu 
50 55 60 

Ser Ala He Thr Gin Gly Leu Ala Trp Tyr Val Ala Glu Lys Asp Tyr 

5 65 70 75 80 

Glu Val Gly He Pro Val He He Val Asn Asp He Lys Lys Ala Met 

85 90 95 



10 Ser Leu He Ala Met Glu Phe Tyr Gly Asn Pro Gin Glu Lys Leu Lys 
100 105 110 



15 



Leu Leu Ala Phe Thr Gly Thr Lys Gly Lys Thr Thr Ala Ala Tyr Phe 
115 120 125 

Ala Tyr Asn He Leu Ser Gin Gly His Arg Pro Ala Met Leu Ser Thr 
130 135 140 



Met Asn Thr Thr Leu Asp Gly Glu Thr Phe Phe Lys Ser Ala Leu Thr 
20 145 150 155 160 

Thr Pro Glu Ser He Asp Leu Phe Asp Met Met Asn Gin Ala Val Gin 
165 170 175 

25 .Asn Asp Arg Thr His Leu He Met Glu Val Ser Ser Gin Ala Tyr Leu 
180 185 190 

Val Lys Arg Val Tyr Gly Leu Thr Phe Asp Val Gly Val Phe Leu Asn 
195 200 205 



30 



lie' Ser Pro Asp His He Gly Pro He Glu His Pro Ser Phe Glu Asp 
210 215 220 



Tyr Phe Tyr His Lys Arg Leu Leu Met Glu Lys Ser Arg Ala Val He 

35 225 230 235 240 

He Asn Ser Asp Met Asp His Phe Ser Val Leu Lys Glu Gin Val Glu 

245 250 255 

40 Asp Gin Asp His Asp Phe Tyr Gly Ser- Gin Phe Asp Asn Gin He Glu 
260 265 270 



45 



Asn Ser Lys Ala Phe Ser Phe Ser Ala Thr Gly Lys Leu Ala Gly Asp 
275 280 285 

Tyr Asp -He Gin Leu He Gly Asn Phe Asn Gin Glu Asn Ala Val Ala 
290 295 300 



Ala Gly Leu Ala Cys Leu Arg Leu Gly Ala Ser Leu Glu Asp He Lys 
50 305 310 315 320 

Lys Gly He Ala Ala Thr Arg Val Pro Gly Arg Met Glu Val Leu Thr 
325 330 335 . 

55 " Gin Lys Asn Gly Ala Lys Val Phe He Asp Tyr Ala His Asn Gly Asp 
340 345 350 
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Ser Leu Lys Lys Leu lie Asn Val Val Glu Thr His Gin Thr Gly Lys 
355 360 365 

lie Ala Leu Val Leu Gly Ser Thr Gly Asn Lys Gly Glu Ser Arg Arg 
5 370 375 380 

Lys Asp Phe Gly Leu Leu Leu Asn Gin His Pro Glu lie Gin Val Phe 
385 390 395 400 

10 Leu Thr Ala Asp Asp Pro Asn Tyr Glu Asp Pro Met Ala lie Ala Asp 

405 410 415 

Glu He Ser Ser Tyr He Asn His Pro Val Glu Lys He Ala Asp Arg 
420 425 430 

15 

Gin Glu Ala He Lys Ala Ala Met Ala He Thr Asn His Glu Leu Asp 
435 440 445 

Ala Val He He Ala Gly Lys Gly Ala Asp Cys Tyr Gin He He Gin 
20 450 455 460 

Gly Lys Lys Glu Ser Tyr Pro Gly Asp Thr Ala Val Ala Glu Asn Tyr 
465 470 475 480 

25 Leu 



<210> 215 
30 <211> 659 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 215 

35 Met He Gin He Gly Lys lie Phe Ala Gly Arg Tyr Arg He Val Lys 
1 5 10 15 

Gin He Gly Arg Gly Gly Met Ala Asp Val Tyr Leu Ala Lys Asp Leu 
20 25 30 

40 

He Leu Asp Gly Glu Glu Val Ala Val Lys Val Leu Arg Thr Asn Tyr 
35 40 45 

Gin Thr Asp Pro He Ala Val Ala Arg Phe Gin Arg Glu Ala Arg Ala 
.45 50 55 60 

Met Ala Asp Leu Asp His Pro His lie Val Arg He Thr Asp He Gly 
65 70 75 80 

50 Glu Glu Asp Gly Gin Gin Tyr Leu Ala Met Glu Tyr Val Ala Gly Leu 

85 90 . 95 . 

Asp Leu Lys -Arg Tyr He Lys Glu His Tyr Pro Leu Ser Asn Glu Glu 
100 105 110 

55 

Ala Val Arg He Met Gly Gin He Leu Leu -Ala Met Arg Leu Ala His 
115 120 125 



170 



WO 01/49721 



PCT/USOO/35604 



Thr Arg Gly lie Val His Arg Asp Leu Lys Pro Gin Asn lie Leu Leu 
•130 135 140 

Thr Pro Asp Gly Thr Ala Lys Val Thr Asp Phe Gly He Ala Val Ala 
14S 150 155 160 

Phe Ala Glu Thr Ser Leu Thr Gin Thr Asn Ser Met Leu Gly Ser Val 
165 170 175 

His Tyr Leu Ser Pro Glu Gin Ala Arg Gly. Ser Lys Ala Thr Val Gin 
180 185 190 

Ser Asp He Tyr Ala Met Gly He He Phe Tyr Glu Met Leu Thr Gly 
195 200 205 

His He Pro Tyr Asp Gly Asp Ser Ala Val Thr He Ala Leu Gin His 
210 215 220 

Phe Gin Lys Pro Leu Pro Ser Val He Ala Glu Asn Pro Ser Val Pro 
225 230 235 240 

Gin Ala Leu Glu Asn Val He He Lys Ala Thr Ala Lys Lys Leu Thr 
245 250 255 

Asn Arg Tyr Arg Ser Val Ser Glu Met Tyr Val Asp Leu Ser Ser Ser 
260 265 270 

Leu Ser Tyr Asn Arg Arg Asn Glu Ser Lys Leu lie Phe Asp Glu Thr 
275 280 285 

Ser Lys Ala Asp Thr Lys Thr Leu Pro Lys Val Ser Gin Ser Thr Leu 
290 295 300 

Thr Ser He Pro Lys Val Gin Ala Gin Thr Glu His Lys Ser He Lys 
305 310 315 320 

Asn Pro Ser Gin Ala Val Thr Glu Glu Thr Tyr Gin Pro Gin Ala Pro 
325 330 335 

Lys Lys His Arg Phe Lys Met Arg Tyr Leu He Leu Leu Ala Ser Leu 
34 0 345 350 

Val Leu Val Ala Ala Ser Leu He Trp He Leu Ser Arg Thr Pro Ala 
355 . 360 365 

Thr He Ala He Pro Asp Val Ala Gly Gin Thr Val Ala Glu Ala Lys 
370 ' 375 ' 380 

Ala Thr Leu Lys Lys Ala. Asn Phe Glu He Gly Glu Glu Lys Thr Glu 
385 390 395 400 

Ala Ser Glu Lys Val Glu Glu ,Gly Arg He He Arg Thr Asp Pro Gly 
405 410 415 



Ala Gly Thr Gly Arg Lys Glu Gly Thr Lys He Asn Leu Val Val Ser 
420 425 430 
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Ser Gly Lys Gin Ser Phe Gin lie Ser Asn Tyr Val Gly Arg Lys Ser 
435 440 445 

Ser Asp Val lie Ala Glu Leu Lys Glu Lys Lys Val Pro Asp Asn Leu 
450 455 460 

lie Lys He Glu Glu Glu Glu Ser Asn Glu Ser Glu Ala Gly Thr Val 
465 470 475 480 

Leu Lys Gin Ser Leu Pro Glu Gly Thr Thr Tyr Asp Leu Ser Lys Ala 
485 490 495 

Thr Gin He Val Leu Thr Val Ala Lys Lys Ala Thr Thr He Gin Leu 
500 505 510 

Gly Asn Tyr lie Gly Arg Asn Ser Thr Glu Val He Ser Glu Leu Lys 
515 520 525 

Gin Lys Lys Val Pro Glu Asn Leu He Lys He Glu Glu Glu Glu Ser 
530 535 540 

Ser Glu Ser Glu Pro Gly Thr He Met Lys Gin Ser Pro Gly Ala Gly 
545 550 555 560 

Thr Thr Tyr Asp Val Ser Lys Pro Thr Gin He Val Leu Thr Val Ala 
565 570 575 

Lys Lys Val Thr Ser Val Ala Met Pro Ser Tyr He Gly Ser Ser Leu 
580 585 590 . * 

Glu Phe Thr Lys Asn Asn Leu He Gin He Val Gly lie Lys Glu Ala 
595 600 605 

Asn lie Glu Val Val Glu Val Thr Thr Ala Pro Ala Gly Ser Ala Glu 
610 615 620 

Gly Met Val Val Glu Gin Ser Pro Arg Ala Gly Glu Lys Val Asp Leu 
625 630 635 640 

Asn Lys Thr Arg Val Lys lie Ser lie Tyr Lys Pro Lys Thr Thr Ser 
645 650 655 



Ala Thr Pro 



<210> 216 
<211> 391 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 216 

Met Lys His Phe Asp Thr lie Val He Gly Gly Gly Pro Ala Gly Met 
15 10 15 

Met Ala Thr He Ser Ser Asn Phe Tyr Gly Gin Lys Thr Leu Leu lie 
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20 



25 



30 



10 



Glu Lys Asn Arg Lys Leu Gly Lys Lys Leu Ala Gly Thr Gly Gly Gly 
35. 40 45 

Arg Cys Asn Val Thr Asn Asn Gly Ser Leu Asp Asn Leu Leu Ala Gly 
50 55 60 

He Pro Gly Asn Gly Arg Phe Leu Tyr Ser Val Phe Ser Gin Phe Asp 
65 70 75 80 

Asn His Asp He He Asn Phe Phe Thr Glu Asn Gly Val Lys Leu Lys 
85 90 95 



15 Val Glu Asp His Gly Arg Val Phe Pro Ala Ser Asp Lys Ser Arg Thr 
100 105 110 



20 



25 



He He Glu Ala Leu Glu Lys Lys He Thr Glu Leu Gly Gly Gin Val 
115 120 125 

Ala Thr Gin He Glu He Val Ser Val Lys Lys Val Asp Asp Gin Phe 
130 135 140 

Val Leu Lys Ser Ala Asp Gin Thr Phe Thr Cys Glu Lys Leu He Val 
145 150 155 160 

Thr Thr Gly Gly Lys Ser Tyr Pro Ser Thr Gly Ser Thr Gly Phe Gly 
165 170 175 



30 His Glu He Ala Arg His Phe Lys His Thr He Thr Asp Leu Glu Ala 
180 . 185 190 



35 



40 



Ala Glu Ser Pro Leu Leu Thr Asp Phe Pro His Lys Ala Leu Gin Gly 
195 200 205 

He Ser Leu Asp Asp Val Thr Leu Ser Tyr Gly Lys His Val He Thr 
210 215 220 

His Asp Leu Leu Phe Thr His Phe Gly Leu Ser Gly Pro Ala Ala Leu 
225 230 235 240 

Arg Met Ser Ser Phe Val Lys Gly Gly Glu Val Leu Ser Leu Asp Val 
245- 250 255 



45 Leu Pro Gin Leu Ser Glu Lys Asp Leu Val Thr Phe Leu Glu Glu Asn 
260 265 270 



50 



55 



Arg Glu Lys Ser Leu Lys Asn Ala Leu Lys Thr Leu Leu Pro Glu Arg 
275 280 285 

Leu Ala Glu Phe Phe Val Gin Gly Tyr Pro Glu Lys Val Lys Gin Leu 
290 . 295 300 

Thr Glu Lys Glu Arg Glu Gin Leu Val Gin Ser He Lys Glu Leu Lys 
305 310 315 320 

lie Pro Val Thr Gly Lys Met Ser Leu Ala Lys Ser Phe Val Thr Lys 
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325 330 335 

Gly Gly Val Ser Leu Lys Glu lie Asn Pro Lys Thr Leu Glu Ser Lys 
340 345 350 

Leu Val Pro Gly Leu His Phe Ala Gly Glu Val Met Asp lie Asn Ala 
355 360 365 

His Thr Gly Gly Phe Asn lie Thr Ser Ala Leu Cys Thr Gly Trp Val 
370 375 380 

Ala Gly Ser Leu His Tyr Asp 
385 390 



<210> 217 
<211> 231 
<2'12> PRT 

<213> Streptococcus pneumoniae 
<400> 217 

Met Leu Lys Trp Glu Asp Leu Pro Val Glu Met Lys Ser Ser Glu Val 
1 5 10 '15 

Glu Ser Tyr Tyr Gin Leu Val Ser Lys Arg Lys Gly Ser Leu lie Phe 
20 25 30 

Lys Arg Cys Leu Asp Trp Val Leu Ala Leu Vai Leu Thr Trp Val Leu 
35 40 45 



Thr Ser Pro lie Phe Leu lie Leu Ser lie Trp He Lys Leu Asp Ser 
50 55 60 

Lys Gly Pro Val He Tyr Lys Gin Glu Arg Val Thr Gin Tyr Asn Arg 
65 70 75 80 

Arg Phe Lys He Trp Lys Phe Arg Thr Met Val Thr Asp Ala Asp Lys 
85 90 95 

Lys Gly Ser Leu Val Thr Ser Ala Asn Asp Ser Arg He -Thr Lys Val 
100 105 110 



Gly Asn Phe He Arg Arg Val Arg Leu Asp Glu Leu Pro Gin Leu Val 
115 120 125 

Asn Val Leu Lys Gly Glu Met Ser Phe Val Gly Thr Arg Pro Glu Val 
130 135 140 

Pro Arg Tyr Thr Glu Gin Tyr Ser Pro Glu Met Met Ala Thr Leu Leu 
145 150 155 160 

Leu Gin Ala Gly He Thr Ser Pro Ala Ser He Asn Tyr Lys Asp Glu 
165 170 175 

Asp Thr He He Ser Gin Met Thr Glu Lys Gly Leu Ser Val Asp Gin 
180 185 190 
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Ala Tyr Val Glu His Val Leu Pro Glu Lys Met Arg Tyr Asn Leu Ala 
195 200 205 

Tyr Leu Arg Glu Phe Ser Phe Phe Gly Asp lie Lys lie Met Phe Gin 
210 215 220 

Thr Val Phe Glu Val Leu Lys 
225 230 



10 



15 



<210> 218 
<211> 140 
<212> PRT 

<213> .Streptococcus pneumoniae 
<400> 218 

Met Thr Ser Pro Leu Leu Glu Ser Arg Arg Gin Leu Arg Lys Cys Ala 
■ 1 5 10 15 

' 20 Phe Gin Ala Leu Met Ser Leu Glu Phe Gly Thr Asp Val Glu Thr Ala 

20 25 30 

Cys* Arg Phe Ala Tyr Thr His Asp Arg Glu Tyr Thr Asp Val Gin Leu 
35 40 45 

25 

Pro Ala Phe Leu lie Asp Leu Val Ser Gly Val Gin Ala Lys Lys Glu 
50 55 60 

Glu Leu Asp Lys Gin lie Thr Gin His Leu Lys Ala Gly Trp Thr He 
30 65 70 75 80 

Glu Arg Leu Thr Leu Val Glu Arg Asn Leu Leu Arg Leu Gly Val Phe 
85 90 .95 

35 Glu He Thr Ser Phe Asp Thr Pro Gin Leu Val Ala Val Asn Glu Ala 
100 105 110 

He Glu Leu Ala Lys Asp Phe Ser Asp Gin Lys Ser Ala Arg Phe He 
115 120 125 

40 

Asn Gly Leu Leu Ser Gin Phe Val Thr Glu Glu Gin 
130 135 140 



45 <210> 219 
<211> 1179 
<212> PRT 

<213> Streptococcus pneumoniae 
50 <400> 219 

Met Tyr Leu Lys Glu lie Glu lie Gin Gly Phe Lys Ser Phe Ala Asp 
1 5 10 15 

Lys Thr Lys Val Val Phe Asp Gin Gly Val Thr Ala Val Val Gly Pro 
55 20 25 30 

Asn Gly Ser Gly Lys Ser Asn He Thr Glu Ser Leu Arg Trp Ala Leu 
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35 



40 



45 



Gly Glu Ser Ser Val Lys Ser Leu Arg Gly Gly Lys Met Pro Asp Val 
50 55 60 

lie Phe Ala Gly Thr Glu Ser Arg Lys Pro Leu Asn Tyr Ala Ser Val 
65 70 75 80 

Val Val Thr Leu Asp Asn His Asp Gly Phe He Lys Asp Ala Gly Gin 
85 90 95 

Glu He Arg Val Glu Arg His He Tyr Arg Ser Gly Asp Ser Glu Tyr 
100 105 110 

Lys He Asp Gly Lys Lys Val Arg Leu Arg Asp. lie His Asp Leu Phe 
115 120 125 

Leu Asp Thr Gly Leu Gly Arg Asp Ser Phe Ser He He Ser Gin Gly 
130 135 140 

Lys Val Glu Glu He Phe Asn Ser Lys Pro Glu Glu Arg Arg Ala He 
145 150 155 160 

Phe Glu Glu Ala Ala Gly Val Leu Lys Tyr Lys Thr Arg Arg Lys Glu 
165 170 175 

Thr Glu Ser Lys Leu Gin Gin Thr Gin Asp Asn Leu Asp Arg Leu Glu 
180 185 190 

Asp He He Tyr Glu Leu Asp Asn Gin He Lys Pro Leu Glu Lys Gin 
195 . . 200 205 

Ala Glu Asn Ala Arg Lys Phe Leu Asp Leu Glu Gly Gin Arg Lys Ala 
* 210 215 220 

lie Tyr Leu Asp Val Leu Val Ala Gin He Lys Glu Asn Lys Ala Glu 
225 230 235 240 

Leu Glu Ser Thr Glu Glu Glu Leu Ala Gin Val Gin Glu Leu Leu Met 
245 250 255 

Ser Tyr Tyr Gin Lys Arg Glu Lys Leu Glu Glu Glu Asn Gin Thr Leu 
260 265 27-0 

Lys Lys Gin Arg Gin Asp Leu Gin Ala Glu Met Ala Lys Asp Gin Gly 
275 280 285 

Ser Leu Met Asp Leu Thr Ser Leu lie Ser Asp Leu Glu Arg Lys Leu 
290 295 300 

Ala Leu Ser Lys Leu Glu Ser Glu Gin Val Ala Leu Asn Gin Gin Glu 
305 310 315 320 

Ala Gin Ala Arg Leu Ala Ala Leu Glu Asp Lys Arg Asn Ser Leu Ser 
325 330 335 



Lys Glu Lys Tyr Asp Lys Glu Ser Ser Leu Ala Leu Leu Glu Gly Asn 
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340 



345 



350 



Leu Val Gin Asn Asn Gin Lys Leu Asn Arg Leu Glu Ala Glu Leu Leu 
355 360 365 

Ala Phe Ser Asp Asp Pro Asp Gin Met lie Glu Leu Leu Arg Glu Arg 



Arg lie Glu Asn Glu Leu Glu Asn Ser Arg Gin Leu Ser Gin Lys Gin 
405 410 415 

Ala Asp Gin Leu Glu Lys Leu Lys Glu Gin Leu Ala Thr Ala Lys Glu 
420 425 430 

Lys Ala Ser Gin Gin Lys Asp Glu Leu Glu Thr Ala Lys Val Gin Val 
435 440 445 

Gin Lys Leu Leu Ala Asp TyrGln Ala lie Ala Lys Glu Gin Glu Glu 
450 455 460 

Gin Lys Thr Ser Tyr Gin Ala Gin Gin Ser Gin Leu Phe Asp Arg Leu 
465 470 475 480 

Asp Ser Leu Lys Asn Lys Gin Ala Arg Ala Gin Ser Leu Glu Asn lie 
485 490 495 

Leu Arg Asn His Ser Asn' Phe Tyr Ala Gly Val Lys Ser Val Leu Gin 
500 505 510 

Glu Lys Asp Arg Leu Gly Gly lie lie Gly Ala Val Ser Glu His Leu 
515 520 525 

Thr Phe Asp Val Tyr Tyr Gin Thr Ala Leu Glu He Ala Leu Gly Ala 
530 535 540 

Ser Ser Gin His He He Val Glu Asp Glu Glu Ser Ala Thr Lys Ala 
545 550 555 560 

He Asp Phe Leu Lys Arg Asn Arg Val Gly Arg Ala Thr Phe Leu Pro 
565 570 575 

Leu Thr Thr He Lys Ala Arg Thr He Ser Ser Gin Asn Gin Asp Ala 
580 585 . 590 

He Ala Val Ser Pro Gly Phe Leu Gly Met Ala Asp Glu Leu Val Thr 
595 600 605 

Phe Asp Thr Arg Leu Glu Ala He Phe Lys Asn Leu Leu Ala Thr Thr 
610 615 620 

Ala He Phe Asp Thr Val Glu His Ala Arg Glu Ala Ala Arg Gin Val 
625 630 635 640 

Arg Tyr Gin Val Arg Met Val, Thr Leu Asp Gly Thr Glu Leu Arg Thr 



370 



375 



380 



Phe Val Ala Leu Leu Gin Glu Glu 
385 390 



Ala Asp Val Ser Asn Gin Leu Thr 
395 400 
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645 



650 



655 



Gly Gly Ser Tyr Ala Gly Gly Ala Asn Arg Gin Asn Asn Ser He Phe 
660 665 670 

5 

He Lys Pro Glu Leu Glu Gin Leu Gin Lys Glu He Ala Ala Asp Glu 
675 680 685 

Ala Ser Leu Gly Ser Glu Glu Ala Ala Leu Lys Thr Leu Gin Asp Gin 
10 690 695 700 

Met Ala Ala Leu Thr Glu Arg Leu Glu Ala He Lys Ser Gin Gly Glu 
705 ' 710 715 720 

15 Gin Ala Arg lie Gin Glu Gin Gly Leu Ser Leu Ala Tyr Gin Gin Thr 

725 730 735 



20 



Ser Gin Gin Val Glu Glu Leu Glu Thr Leu Trp Lys Leu Gin Glu Glu 
740 745 750 

Glu He Asp Arg Leu Ser Glu Gly Asp Trp Gin Ala Asp Lys Glu Lys 
755 760 765 



Cys Gin Glu Ser Leu Ala Thr He Ala Ser Asp Lys Gin Asn Leu Glu 
25 770 775 780 

Ala Glu He Glu Glu He Lys Ser Asn Lys Asn Ala He Gin Glu Arg 

785 790 795 800 

30 Tyr Gin Asn Leu Gin Glu Glu Val Ala Gin Ala Arg Leu Leu Lys Thr 

805 810 815 



35 



Lys Leu Gin Gly Gin Lys Arg Tyr Glu Val Ala Asp He Glu Arg Leu 
820 825 830 

Gly Lys Glu Leu Asp Asn Leu Asn He Glu Gin Glu Glu He Gin Arg 
835 840 845 



Met Leu Gin Glu Lys Val Asp Asn Leu Glu Lys Val Asp Thr Glu Leu 
40 850 855 860 

Leu Ser Gin Gin Ala Glu Glu Ser Lys Thr Gin Lys Thr Asn Leu Gin 
865 870 875 880 

45 Gin Gly Leu He Arg Lys Gin Phe Glu Leu Asp Asp He Glu Gly Gin 

88'5 890 895 



50 



Leu Asp Asp He Ala Ser His Leu Asp Gin Ala Arg Gin Gin Asn Glu 
900 ■ 905 910 

Glu Trp He Arg Lys Gin Thr Arg Ala Glu Ala Lys Lys Glu Lys Val 
915 920 925 



Ser Glu Arg Leu Arg His Leu Gin Asn Gin Leu Thr Asp Gin Tyr Gin 
55 930 935 940 



He Ser Tyr Thr Glu Ala Leu Glu Lys Ala His Glu Leu Glu Asn Leu 
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945 950 955 960 

Asn Leu Ala Glu Gin Glu Val Gin Asp Leu Glu Lys Ala lie Arg Ser 
965 970. 975 

5 

Leu Gly Pro Val Asn Leu Glu Ala lie Asp Gin Tyr Glu Glu Val His 
980 985 990 

Asn Arg Leu Asp Phe Leu Asn Ser Gin Arg Asp Asp He Leu Ser Ala 
10 995 , 1000 1005 

Lys Asn Leu Leu Leu Glu Thr He Thr Glu Met Asn Asp Glu Val Lys 
1010 1015 1020 

15 Glu Arg Phe Lys Ser Thr Phe Glu Ala He Arg Glu Ser Phe Lys Val 
1025 1030 1035 1040 

Thr Phe Lys Gin Met Phe Gly Gly Gly Gin Ala Asp Leu He Leu Thr 
1045 1050 1055 

20 

Glu Gly Asp Leu Leu Thr Ala Gly Val Glu He Ser Val Gin Pro Pro 
1060 - 1065 1070 

Gly Lys Lys He Gin Ser Leu Asn Leu Met Ser Gly Gly Glu Lys Ala 
25 1075 1080 1085 

Leu Ser Ala Leu Ala Leu Leu Phe Ser He He Arg Val Lys Thr He 
1090 1095 1100 

30 Pro Phe Val He Leu Asp Glu Val Glu Ala Ala Leu Asp Glu Ala Asn 
1105 1110 1115 1120 

Val Lys Arg Phe Gly Asp Tyr Leu Asn Arg Phe Asp Lys Asp Ser Gin 
1125 1130 1135 

35 

Phe He Val Val Thr His Arg Lys Gly Thr Met Ala Ala Ala -Asp Ser 
1140 1145 1150 

He Tyr Gly Val Thr Met Gin Glu Ser Gly Val Ser Lys He Val Ser 
40 1155 1160 1165 

Val Lys Leu Lys Asp Leu Glu Ser He Glu Gly 
1170 H75 

45 

<210> 220 
<211> 447 
<212> PRT 

<213> Streptococcus pneumoniae 

50 

<400> 220 

Met Thr Lys Arg Val Thr He lie Asp Val Lys Asp Tyr Val Gly Gin 
1 5 10 15 

55 Glu Val Thr He Gly Ala Trp Val Ala Asn Lys Ser Gly Lys Gly Lys 
20 25 30 
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lie Ala Phe Leu Gin Leu Arg Asp Gly Thr Ala Phe Phe Gin Gly Val 
35 40 45 

Ala Phe Lys Pro Asn Phe Val Glu Lys Phe Gly Glu Glu Val Gly Leu 
5 50 55 60 

Glu Lys Phe Asp Val He Lys Arg Leu Ser. Gin Glu Thr Ser Val Tyr 
65 70 75 80 

10 Val Thr Gly He Val Lys Glu Asp Glu Arg Ser Lys Phe Gly Tyr Glu 

85 90 95 



15 



Leu Asp He Thr Asp He Glu Val He Gly Glu Ser Gin Asp Tyr Pro 
100 105 110 

He Thr Pro Lys Glu His Gly Thr Asp Phe Leu Met Asp Asn Arg His 
115 120 125 



Leu Trp Leu Arg Ser Arg Lys Gin Val Ala Val Leu Gin He Arg Asn 
20 130 135 140 

Ala He He Tyr Ala Thr Tyr Glu Phe Phe Asp Lys Asn Gly Phe Met 
145 150 155 160 

25 Lys Phe Asp Ser Pro lie Leu Ser Gly Asn Ala Ala Glu Asp Ser Thr 

165 170 175 



30 



Glu Leu Phe Glu Thr Asp Tyr Phe Gly Thr Pro Ala Tyr Leu Ser Gin 
180 185 190 

Ser Gly Gin Leu Tyr Leu Glu Ala Gly Ala Met Ala Leu Gly Arg Val 
195 200 205 



Phe Asp Phe Gly Pro Val Phe Arg Ala Glu Lys Ser Lys Thr Arg Arg 
35 210 215 220 

His Leu Thr' Glu Phe Trp Met Met Asp Ala Glu Tyr Ser Tyr Leu Thr 
225 230 235 240 

40 His Asp Glu Ser Leu Asp Leu Gin Glu Ala Tyr Val Lys Ala Leu Leu 

245 250 255 



45 



Gin Gly Val Leu Asp Arg Ala Pro Gin Ala Leu Glu Thr Leu Glu Arg 
260 265 270 

Asp Thr Glu Leu Leu Lys Arg Tyr He Ala Glu Pro Phe Lys. Arg lie 

275 280 285 



Thr Tyr Asp Gin Ala He Asp Leu Leu Gin Glu His Glu Asn Asp Glu 
50 290 295 300 

Asp Ala Asp Tyr Glu His Leu Glu His Gly Asp Asp Phe Gly Ser Pro 
305 310 315 320 



55 His Glu Thr Trp He Ser Asn His Phe Gly Val Pro Thr Phe Val Met 

325 330 335 
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Asn Tyr Pro Ala Ala lie Lys Ala Phe Tyr Met Lys Pro Val Pro Gly 
340 345 350 

Asn Pro Glu Arg Val Leu Cys Ala Asp Leu Leu Ala Pro Glu Gly Tyr 
5 355 360 365 

Gly Glu He He Gly Gly Ser Met Arg Glu Glu Asp Tyr Asp Ala Leu 
370 375 380 

10 Val Ala Lys Met Asp Glu Leu Gly Met Asp Arg Thr Glu Tyr Glu Phe 
385 390 ,395 400 

Tyr Leu Asp Leu Arg Lys Tyr Gly Thr Val Pro His Gly Gly Phe Gly 
405 410 415 

15 

He Gly He Glu Arg Met Val Thr Phe Ala Ala Gly Thr Lys His lie 
420 425 430 

Arg Glu Ala He Pro Phe Pro Arg Met Leu His Arg He Lys Pro 
20 435 440 445 



<210> 221 
<211> 308 
25 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 221 

Met Ser Glu Lys . Leu Val Glu He Lys Asp Leu Glu He Ser Phe Gly 
30 1 5 10 15 

Glu Gly Ser Lys Lys Phe Val Ala Val Lys Asn Ala Asn Phe Phe He 
20 25 30 

35 Asn Lys Gly Glu Thr Phe Ser Leu Val Gly Glu Ser Gly Ser Gly Lys 
35 40 45 

Thr Thr He Gly Arg Ala lie lie Gly Leu Asn Asp Thr Ser Asn Gly 
50 55 60 

40 

Asp lie lie Phe Asp Gly Gin Lys lie Asn Gly Lys Lys Ser Arg Glu 
65 70 75 80 

Gin Ala Ala Glu Leu He Arg Arg He Gin Met lie Phe Gin Asp Pro 
45 85 90 95 

Ala Ala Ser Leu Asn Glu Arg Ala Thr Val Asp Tyr He He Ser Glu 
100 105 ~ 110 

50 Gly Leu Tyr Asn His Arg Leu Phe Lys Asp Glu Glu Glu Arg Lys Glu 
115 120 125 

Lys Val Gin Asn He lie Arg Glu Val Gly Leu Leu Ala Glu His Leu 
130 135 140 

55 

Thr Arg Tyr Pro His Glu Phe Ser Gly Gly Gin Arg Gin Arg He Gly 
145 150 155 160 
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lie Ala Arg Ala 



5 Pro lie Ser Ala 
180 

Leu Lys Lys Phe 
195 

10 

His Asp Leu Ser 
210 

Tyr Lys Gly Val 
15 225 



Leu Val Met Gin Pro Asp 
165 170 

Leu Asp Val Ser Val Arg 

185 ■ 

Gin Lys Glu Leu Gly Leu 
200 

Val Val Arg Phe lie Ser 
215 

lie Val Glu Val Ala Glu 
230 



Phe Val lie Ala Asp Glu ■ 
175 

Ala Gin Val Leu Asn Leu 
190 

Thr Tyr Leu Phe He Ala 
205 

Asp Arg He Ala Val He 
220 

Thr Glu Glu Leu Phe Asn 
235 240 



Asn Pro He His Pro Tyr Thr Gin Ala Leu Leu Ser Ala Val Pro He 

245 250 , 255 

20 Pro Asp Pro He Leu Glu Arg Lys Lys Val Leu Lys Val Tyr Asp Pro 

260 265 270 



Ser Gin His Asp Tyr Glu Thr Asp Lys Pro Ser Met Val Glu He Arg 
275 280 285 

25 

Pro Gly His Tyr Val Trp Ala Asn Gin Thr Glu Leu Ala Arg Tyr Gin 
290 295 300 

Lys Gly Leu Asn 
30 305 



<210> 222 
<211> 424 
35 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 222 

Met Lys He Ser Trp Asn Gly Phe Ser Lys Lys Ser Tyr Gin Glu Arg 
40 1 5 .10 15 



Leu Glu Leu Leu Lys Ala Gin 
20 

45 Ser Leu Glu Lys Asp Glu Gin 
35 

Ser Glu Asn Val Val. Gly Thr 
50 55 

50 

Glu Val Leu Val Asn Gly Gin 
. 65 70 

Glu Pro Ser Val Val Ala Ala 
55 • 85 

Arg Ala Gly Gly Phe Thr Ala 



Ala Leu Leu Ser Pro Glu Arg Gin Ala 
25 30 

Met Ser Val Thr Val Ala Asp. Gin Leu 
40 . 45 

Phe Ser "Leu Pro Tyr Ser Leu Val Pro 
60 

Glu Tyr Thr Val Pro Tyr Val Thr Glu 
75 80 

Ala Ser Tyr Ala Ser Lys He lie Lys 
90 95 

Gin Val His Gin Arg Gin Met He Gly 
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100 



105 



110 



Gin Val Ala Leu Tyr Gin He Ala Asn Pro Lys Leu Ala Gin Glu Lys 
115 120 125 

5 

He Ala Ser Lys Lys Ala Glu Leu Leu Glu Leu Ala Asn Gin Ala Tyr 
130 135 140 

Pro Ser He Val Lys Arg Gly Gly Gly Ala Arg Asp Leu His Val Glu 
10 145 150 155 160 

Gin He Lys Gly Glu Pro Asp Phe Leu Val Val Tyr He His Val Asp 
165 170 175 

15 Thr Gin Glu Ala Met Gly Ala Asn Met Leu Asn Thr Met Leu Glu Ala 
180 185 190 



20 



Leu Lys Pro Val Leu Glu Glu Leu Ser Gin Gly Gin Ser* Leu Met Gly 
195 200 205 

He Leu Ser Asn Tyr Ala Thr Asp Ser Leu Val Thr Ala Ser Cys Arg 

210 215 220 



He Ala Phe Arg Tyr Leu Ser Arg Gin Lys Asp Gin Gly Arg Glu lie 
25 225 230 235 240 

Ala Glu Lys He Ala Leu Ala Ser Gin Phe Ala Gin Ala Asp Pro Tyr 
245 250 255 

30 Arg Ala Ala Thr His Asn Lys Gly He Phe Asn Gly He Asp Ala He 
260 265 270 



35 



Leu He Ala Thr Gly Asn Asp Trp Arg Ala He Glu Ala Gly Ala His 
275 280 285 

Ala Phe Ala Ser Arg Asp Gly Arg Tyr Gin Gly Leu Ser Cys Trp Thr 
290 295 300 



Leu Asp Leu Glu Arg Glu Glu Leu Val Gly Glu Met Thr Leu Pro Met 
40 305 310 315 320 

Pro Val Ala Thr Lys Gly Gly Ser He Gly Leu Asn Pro Arg Val Ala 
■325 330 335 

45 Leu Ser His Asp Leu Leu Gly Asn Pro Ser Ala Arg Glu Leu Ala Gin- 
340 345 350 



50 



He He Val Ser He Gly Leu Ala Gin Asn. Phe Ala Ala Leu Lys Ala 

355 360 365 

Leu Val Ser Thr Gly He Gin Gin Gly His Met Lys Leu Gin Ala Lys 

370 375 380 



Ser Leu Ala Leu Leu Ala Gly Ma Ser Glu Ser Glu Val Ala Pro Leu 
55 385 390 395 400 



Val Glu Arg Leu He Ser Asp Lys Thr Phe Asn Leu Glu Thr Ala Gin 
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405 



410 



415 



Arg Tyr Leu Glu Asn Leu Arg Ser 
420. 



<210> 223 
<211> 262 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 223 

Met Pro lie Thr Ser Leu Glu lie Lys Asp Lys Thr Phe Gly Thr Arg 
1 5 10 15 

Phe Arg Gly Phe Asp Pro Glu Glu Val Asp Glu Phe Leu Asp lie Val 
20 25 30 

Val Arg Asp Tyr Glu Asp Leu Val Arg Ala Asn His Asp Lys Asn Leu 
35 40 45 

Arg lie Lys Ser Leu Glu Glu Arg Leu Ser Tyr Phe Asp Glu lie Lys 
50 55 60 

Asp Ser Leu Ser Gin Ser Val Leu lie Ala Gin Asp Thr Ala Glu Arg 
65 70 75 80 

Val Lys Gin Ala Ala His Glu Arg Ser Asn Asn lie lie His Gin Ala 
85 90 95 

Glu Gin Asp Ala Gin Arg Leu Leii Glu Glu Ala Lys Tyr Lys Ala Asn 
100 105 110 

Glu lie Leu Arg Gin Ala Thr Asp Asn Ala Lys Lys Val Ala Val Glu 
115 120 125 

Thr Glu Glu Leu Lys Asn Lys Ser Arg Val Phe His Gin Arg Leu Lys 
130 135 140 

Ser Thr lie Glu Ser Gin Leu Ala lie Val Glu Ser Ser Asp Trp Glu 
145 150 155 • 160 

Asp lie Leu Arg Pro Thr Ala Thr Tyr Leu Gin Thr Ser Asp Glu Ala 
165 170 175 

Phe Lys Glu Val Val Ser Glu Val Leu Gly Glu Pro He Pro Ala Pro 
180 185 190 

He Glu Glu Glu Pro He Asp Met Thr Arg Gin Phe Ser Gin Ala Glu 
195 200 205 

Met Ala Glu Leu Gin Ala Arg He Glu Val Ala Asp Lys Glu Leu Ser 
210 215 220 

Glu Phe Glu Ala Gin He Lys Gin Glu Val Glu Ala Pro Thr Pro Val 
225 230 235 240 
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Val Ser Pro Gin Val Glu Glu Glu Pro Leu Leu lie Gin Leu Ala Gin 
245 250 255 

Cys Met Lys Asn Gin Lys 
260 



<210> 224 

<211> 575 

<212> PRT 

<213> Streptococcus pneumoniae 

<400> 224 

Met Ser Asn Gly Gin Leu He Tyr Leu Met Val Ala He Ala Val He 
1 5 10 15 

Leu Val Leu Ala Tyr Val Val Ala He Phe Leu Arg Lys Arg Asn Glu 
20 25 30 

Gly Arg Leu Glu Ala Leu Glu Glu Arg Lys" Glu Glu Leu Tyr Asn Leu 
35 40 45 

Pro Val Asn Asp Glu Val Glu Ala Val Lys Asn Met His Leu lie Gly 
50 55 60 

Gin Ser Gin Val Ala Phe Arg Glu Trp Asn Gin Lys Trp Val Asp Leu 
65 70 75 80 

Ser Leu Asn Ser Phe Ala Asp He Glu Asn Asn Leu Phe Glu Ala Glu 
85 90 95 

Gly Tyr Asn His Ser Phe Arg Phe Leu Lys Ala Ser His Gin He Asp 
100 105 110 

Gin He Glu Ser Gin He Thr Leu He Glu Glu Asp He Ala Ala He 
115 120 125 

Arg Asn Ala Leu Ala Asp Leu Glu Lys Gin Glu Ser Lys Asn Ser Gly 
130 135 140 

Arg Val Leu His Ala Leu Asp Leu Phe Glu Glu Leu Gin His Arg Val 
145 150 155 160 

Ala Glu Asn Ser Glu Gin Tyr Gly Gin Ala Leu Asp Glu lie Glu Lys 
165 170 175 

Gin Leu Glu Asn He Gin Ser Glu Phe Ser Gin Phe Val Thr Leu Asn 
180 185 " 190 

Ser Ser Gly Asp Pro Val Glu Ala Ala Val lie Leu Asp Asn Thr Glu 
195 200 205 

Asn His He Leu Ala Leu Ser His lie Val Asp Arg Val Pro Ala Leu 
210 215 220 

Val Thr Thr Leu. Ser Thr Glu Leu Pro Asp Gin Leu Gin Asp Leu Glu 
225 230 235 240 
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Ala Gly Tyr Arg Lys Leu He Asp Ala Asn Tyr His Phe Val Glu Thr 

245 250 255 

Asp He Glu Ala Arg Phe His Leu lieu Tyr Glu Ala Phe Lys Lys Asn 

260 . 265 270 



10 



Gin Glu Asn He Arg Gin Leu Glu Leu Asp Asn Ala Glu Tyr Glu Asn 
275 280 285 

Gly Gin Ala Gin Glu Glu He Asn Ala Leu Tyr Asp He Phe Thr Arg 
290 295 300 



Glu He Ala Ala Gin Lys Val Val Glu Asn Leu Leu Ala Thr Leu Pro 
15 305 310 315 320 

Thr Tyr Leu Gin His Met Lys Glu Asn. Asn Thr Leu Leu Gly Glu Asp 
325 - 330 335 

20 He Ala Arg Leu Asn Lys Thr Tyr Leu Leu Pro Glu Thr Ala Ala Ser 
340 ' 345 350 



25 



His Val Arg Arg lie Gin Thr Glu Leu Glu Ser Phe Glu Ala Ala lie 
355 360 365 

Val Glu Val Thr Ser Asn Gin Glu Glu Pro Thr Gin Ala Tyr Ser Val 
370 375 380 



Leu Glu Glu Asn Leu Glu Asp Leu Gin Thr Gin Leu Lys Asp He. Glu 
30 385 390 395 400 

Asp Glu Gin He Ser Val Ser Glu Arg Leu Thr Gin He Glu Lys Asp 
405 410 415 

35 Asp He Asn Ala Arg Gin Lys Ala Asn Val Tyr Val Asn Arg Leu His 
420 425 430 



40 



45 



Thr He Lys Arg Tyr Met Glu Lys Arg Asn Leu Pro Gly He Pro Gin 
435 440 445 

Thr Phe Leu Lys Leu Phe Phe Thr Ala Ser Asn Asn Thr Glu Asp Leu 
450 455 460 

Met Val Glu Leu Glu Gin Lys Met He Asn He Glu Ser Val Thr Arg 
465 470 475 480 

Val Leu Glu He Ala Thr Asn Asp Met .Glu Ala Leu Glu Thr Glu Thr 
485 490 495 



50 Tyr Asn He Val Gin Tyr Ala Thr Leu Thr Glu Gin Leu Leu Gin Tyr 
500 505 510 



55 



Ser Asn Arg Tyr Arg Ser Phe Asp Glu . Arg He Gin Glu Ala Phe Asn 
515 520 525 

Glu Ala Leu Asp He Phe Glu Lys Glu Phe Asp Tyr His Ala Ser Phe 
530 535 540 
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Asp Lys lie Ser Gin Ala Leu Glu Val Ala Glu Pro Gly Val Thr Asn 
545 550 555 560 

Arg Phe Val Thr Ser Tyr Glu Lys Thr Arg Glu Thr lie Arg Phe 
565 570 575 



<210> 225 

<211> 800 

<212> PRT 

<213> Streptococcus pneumoniae 

<400> 225 

Met Leu He Ser Tyr Lys Trp Leu Lys Glu Leu Val Asp He Asp Val 
1 5 10 15 

Pro Ser Gin Glu Leu Ala Glu Lys Met Ser Thr Thr Gly He Glu Val 
20 25 30 

Glu Gly Val Glu Ser Pro Ala Ala Gly Leu Ser Lys lie Val Val Gly 
35 40 45 

Glu Val Leu Ser Cys Glu Asp Val Pro Glu Thr His Leu His Val Cys 
50 55 60 

Gin Val Asn Val Gly Glu Glu Glu Arg Gin He Val Cys Gly Ala Pro 
65 70 75 80 

Asn Val Arg Ala Gly He Lys Val Met Val Ala Leu Pro Gly Ala Arg 
85 90 95 

He Ala Asp Asn Tyr Lys He Lys Lys Gly Lys He Afg Gly Leu Glu 
100 105 110 

Ser Leu Gly Met He Cys Ser Leu Gly Glu Leu Gly He Ser Asp Ser 
115 120 125 

Val Val Pro Lys Glu Phe Ala Asp Gly He Gin He Leu Pro Glu Asp 
130 135 140 

Ala Val Pro Gly Glu Glu Val Phe Ser Tyr Leu Asp Leu Asp Asp Glu 
145 150 155 160 

lie lie Glu Leu Ser lie Thr Pro Asn Arg Ala Asp Ala Leu Ser Met 
165 170 175 

Cys Gly Val Ala His Glu Val Ala Ala lie Tyr Asp Lys Ala Val Asn 
180 185 190 

Phe Lys Glu Phe Thr Leu Thr Glu Thr Asn Glu Ala Ala Ala Asp Ala 
195 200 205 

Leu Ser Val Ser He Glu Thr Asp Lys Ala Pro Tyr Tyr Ala Ala Arg 
210 215 220 

He Leu Asp Asn Val Thr lie Ala Pro Ser Pro Gin Trp Leu Gin Asn 
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225 



230 



235 



240 



Leu Leu Met Asn Glu Gly lie Arg Pro lie Asn Asn Val Val Asp Val 
245 250 255 

Thr Asn Tyr lie Leu Leu Tyr Phe Gly Gin Pro Met His Ala Phe Asp 
260 265 270 

Leu Asp Asn Phe Glu Gly Thr Asp He Arg Val Arg Glu Ala Arg Ala 
275 280 285 

Gly Glu Lys Leu Val Thr Leu Asp Gly Glu Glu Arg Asp Leu Asp Val 
290 295 300 

Asn Asp Leu Val He Thr Val Ala Asp Lys Pro Val Ala Leu Ala Gly 
305 310 315 320 

Val Met Gly Gly Gin Ala Thr Glu He Ser Glu Lys Ser Ser Arg Val 
325 330 335 

Val Leu Glu Ala Ala Val Phe Asn Gly Lys Ser He Arg Lys Thr Ser 
340 345 350 

Gly Arg Leu Asn Leu Arg Ser Glu Ser Ser Ser Arg Phe Glu Lys Gly 
355 360 365 

lie Asn Val Ala Thr Val Asn Glu Ala Leu Asp Ala Ala Ala Ser Leu 
370 375 380 

lie Ala Glu Leu Ala Gly Ala Thr Val Arg Lys Gly lie Val Ser Ala' 
385 390 . 395 400 

Gly Glu Leu Asp Thr Ser Asp Val Glu Val Ser Ser Thr Leu Ala Asp 
405 410 415 

Val Asn Arg Val Leu Gly Thr Glu Leu Ser Tyr Ala Asp Val Glu Asp 
420 425 430 

Val Phe Arg Arg Leu Gly Phe Gly Leu Ser Gly Asn Ala Asp Ser Phe 
435 440 445 

Thr Val Arg Val Pro Arg Arg Arg Trp Asp lie Thr lie Glu Ala Asp 
450 455 460 

Leu Phe Glu Glu He Ala Arg lie Tyr Gly Tyr Asp Arg Leu Pro Thr 
465 ' 470 475 480 

Ser Leu Pro Lys Asp Asp Gly Thr Ala Gly Glu Leu Thr Ala Thr Gin 
485 490 495 

Lys Leu Arg Arg Gin Val Arg Thr He Ala Glu Gly Ala Gly Leu Thr 
500 505 510 

Glu lie lie Thr Tyr Thr Leu Thr Thr Pro Glu Lys Ala Val Glu Phe 
515 520 525 



Thr Ala Gin Pro Ser Asn Leu Thr Glu Leu Met Trp Pro Met Thr Val 
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530 



535 



540 



Asp Arg Ser Val Leu Arg Gin Asn Met lie Ser Gly lie Leu Asp Thr 
545 550 555 560 

Val Ala Tyr Asn Val Ala Arg Lys Asn Lys Asn Leu Ala Leu Tyr Glu 
565 570 575 

lie Gly Lys Val Phe Glu Gin Thr Gly Asn Pro Lys Glu Glu Leu Pro 
580 585 590 

Asn Glu lie Asn Ser Phe Ala Phe Ala Leu Thr Gly Leu Val Ala Glu 
595 600 605 

Lys Asp Phe Gin Thr Ala Ala Val Pro Val Asp Phe Phe Tyr Ala Lys 
610 615 620 

Gly lie Leu Glu Ala Leu Phe Thr Arg Leu Gly Leu Gin Val Thr Tyr 
625 630 635 640 

Thr Ala Thr Ser Glu lie Ala Ser Leu His Pro Gly Arg Thr Ala Val 
645 650 655 

lie Ser Leu Gly Asp Gin Val Leu Gly Phe Leu Gly Gin Val His Pro 
660 665 670 

Val Thr Ala Lys Ala Tyr Asp lie Pro Glu Thr Tyr Val Ala Glu Leu 
675 680 685 

Asn Leu Ser Ala lie Glu Ala Ala Leu Gin Pro Ala Thr Pro Phe Val 
690 695 700 

Glu lie Thr Lys Phe Pro Ala Val Ser Arg Asp Val Ala Leu Leu Leu 
705 710 715 720 

Lys Ala Glu Val Thr His Gin Glu Val Val Asp Ala lie Gin Ala Ala 
725 730 735 

Gly Val Lys Arg Leu Thr Asp lie Lys Leu Phe Asp Val Phe Ser Gly 
740 745 750 

Glu Lys Leu Gly Leu Gly Met Lys Ser Met Ala Tyr Ser Leu Thr Phe 
755 760 765 

Gin Asn Pro Glu Asp Ser Leu Thr Asp Glu Glu Val Ala Arg Tyr Met 
770 775 780 



Glu Lys He Gin Ala Ser Leu Glu Glu Lys Val Asn Ala Glu Val Arg 
785 790 795 800 



<210> 226 
<211> 180 
<212> PRT 
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<213> Streptococcus pneumoniae 
<400> 226 

Met Leu Glu Asn Asp He Lys Lys*Val Leu Val Ser His Asp Glu He 
5 1 5 10 15 

Thr Glu Ala Ala Lys Lys Leu Gly Ala Gin Leu Thr Lys Asp Tyr Ala 
20 25 30 

10 Gly Lys Asn Pro He Leu Val Gly He Leu Lys Gly Ser He Pro Phe 
35 40 45 ' ■ 

Met Ala Glu Leu Val Lys His He Asp Thr His He Glu Met Asp' Phe 
50 55 60 

15 

Met Met Val Ser Ser Tyr His Gly Gly Thr Ala Ser Ser Gly Val He 
65 70 75 80 

Asn He Lys Gin Asp Val Thr Gin Asp He Lys Gly Arg His Val Leu 
20 85 90 95 

Phe Val Glu Asp He He Asp Thr Gly Gin Thr Leu Lys Asn Leu Arg 
100 105 110 

25 Asp Met Phe Lys Ala Arg Glu Ala Ala Ser Val Lys He. Ala Thr Leu 
115 120 125 



30 



Leu Asp Lys Pro Glu Gly Arg Val Val Glu He Glu Ala Asp Tyr Thr 
130 135 140 

Cys Phe Thr He Pro Asn Glu Phe Val Val Gly Tyr Gly Leu Asp Tyr 
145 * " 150 155 " 160 



Lys Glu Asn Tyr Arg Asn Leu Pro Tyr He Gly Val Leu Lys Glu Glu 
35 165 170 175 



Val Tyr Ser Asn 
180 



40 
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